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CHAPTER I 



STATEMENT OF THE PROBLEM 



purposa of this gtudy was to soa whether differential treatroant 
of beginning teachers during their first year of teaching results In sig- 
nificant changes in either the classroom behavior of these teachers and/ 
or their attitudes towards the profession of teaching* 

Unlike other professional neophytes, teachers begin at the top# 

Th e teacher in her first dsy has exactly the same responsibilities as 
the most able and experienced teacher on the staff* The lack of a for- 
mal induction procedure for teachers seems to some researchers to account 
* for the very slow rate of professional growth of teachers* For example, 
Dan Lortie summarises the typical pattern of the beginning teachers* 
orientation in three stages: (a) a year or two of struggles to get 
through each day without major damage to students or self ? CB) a period 
of attempts to innovate (which usually attract unfavorable attention 
from ad mi nistrators)} and (e) crystallisation into conventional prac- 
tice* 1 Thus does each wave of new teachers arrive at the same stage of 
mediocrity# 

In the State of Washington the State Board of Education, which has 
legal responsibility for policies governing teacher education, appointed 
in I960 a special committee to recommend changes in standards for the 



^Lortie. Ban* “Teacher Socialization —The Robinson Crusoe Model” 

" ^ * . * # # M . • If##. IU UltAW 




ax TErs uoarerence# wasbuigoon um * amvwu** * r# iToiic 

tloa and ProfsBsional Standards, Jiatiowl Education Association, 1966, p. 59 



education of teachers* This committee, the Teacher Education Standards 
Revision Committee, was charged with responsibility of reconroiending chang- 
es in the teacher education program in the State of Washington. After 
a series of discussions, the committee recommended a study of the pos- 
sible effects of a major change in the treatment of the beginning tea- 
cher during the first mouths of service. It was the committee^ opinion 
that teachers adapt the procedures of other teachers with whom they 
associate during these first months, and only to a small degree apply the 
procedures that they have studied in their college teacher education 
programs. The committee also felt that it was very possible that many 
teachers form their basic attitudes towards their profession in these 
first months of service* The committee reasoned th*o a major change in the 
treatment of teachers should involve different teaching assignments 
and much more than the usual amount of inservice help for these beginners* 
Such an effort could make a greater effect on the behavior of the class- 
room teacher than any other effort in the entire teacher education 
program. 

Another consideration was the fact that in the State of Was h i n gton 
the first year of teaching is conceived of as part of the basic educa- 
tion of the teacher. After four years of study in an approved teacher 
education program, a teach®? graduates with the provisional certificate. 
The standard certificate is granted after two years of successful teaching 
and an additional year of college study* Although this beginning esperi- 
ienoe is thus a part of the actual training program, in fact, most school 
districts have rarely reported the two years of experience as successful 
without seriously evaluating the performance of the teacher, or making 
searching recommendations for furthor training of tha teacher. 
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Both researchers and practitioners strongly urge that the period 
of beginning teaching should be conceived of as the critics! stage in the 
development and refinement of the technology of classroom teaching* For 
example* the 1965 Conference of the National Commission on Teacher Edu- 
cation and Professional Standards; in "me Heal world of the Beginning 
Teacher^ 11 heard a variety of authorities and participants underline the 



form of internship instead of a full teaching assignment for the begin- 



At about the spme time the committee was making the recommendation 
to the State Board; the James H* Conaxxt report on teacher education was 
published* One of the major recommendations of the Ccnant report; recom- 
mendation number 11; seemed consistent with the recommendation of the 



The recommendation of the committee; and the basis for this project; 
was that the State should undertake an experimental study to determine 
whether or not a substantial reduction In the load of the beginning 
teacher; together with intensive in-service training; might affect the 
classroom behavior of these teachers and also affect their attitudes to- 
ward their profession* 

^Sgq National Education Association; National Commission on Teacher 
Education and Professional Standards* The Real World of the B eginoing Tea- 
cher» Report of the Nineteenth National Te»S Conference* Washington n»c*s 

2 Conant, James B« The Education of the American Teacher * McGraw- 
Hill Co*; 1963* 



need for new directions in teacher education which would focus on some 



ning years* 1 




OBJECTIVES AND HYPOTHESIS OF THE 3TUDT 



The specific objectives which guided this research were* 

1* To determine whether or not reduced loads of beginning 
teachers would affect the classroom behavior of these 
teachers* 

2m To determine whether or not reduced loads would affect 
the beginning teachers 1 attitudes towards teaching* 

2* To determine whether or not intensive inservice help for 
the beginning teacher would affect their classroom beha- 
vior* 

2u To determine whether or not such intensive inservice 

help would affect the attitudes of thsse beginning teachers* 



Secondary and related objectives of this study were: 

1* To determine whether or not personal or profesiional char- 
acteristics of beginning teachers modified the effects of 
reduced loads and intensive inservice training* 

2* To arrive at a defensible judgement about the amount and 
kind of help for beginning teachers necessary to affect 
these teachers behaviors* 

3* To determine whether or not it would be administratively 
feasible and experimentally reliable to have trained ob- 
servers utilize an evaluative chock list to appraise the 
behavior of the beginning classroom teacher* 



The foUsfeigg hypotheses are stated in the null form: 

1* There will be no significant differences in terms of selected 
aspects of teachers 1 classroom behavior between groups of 
beginning element ary t eschars who have a induction in load 
compared to other teachers who have no rt .-ced load* 

2* There will be no significant differences in terms of their 
opinions about teaching between groups of beginning teachers 
who have reduced load and those who have no reduced load* , 

3* There w ill be no significant differences in terms of selected 
aspects of teachers* classroom behavior among groups of be* 9 



$ 



ginning teachers who receive different types of inservice 
instruction during released time periods* 

U. There will be no significant differences in terns of their 
opinions about teaching among groups of beginning teachers , 
who receive different types of inservice instruction during 
released time periods® 
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CHAPTER III 



RELATED U2WJTWE 

Although there has been a great deal of’ research on teaching effective- 
ness, there has not. been a great deal of study related to the major assump- 
tions of this project* A major assumption of this study is that it is 
feasible to evaluate teaching effectiveness by observing a teacher in the 
act of classroom teaching, and further, that valid instruments exist which 
are sensitive to changes in teaching behavior as observed by trained 
educators. 

A study which has become a milestone in the research on teacher effect- 
iveness was directed by David G. Ryans. Ryans concluded that: 

Research (is needed) into the refinement of observing 
and assessing techniques, leading perhaps ta the development 
of behavioral check lists and forced-choice scales 
tc the end of providing more valid behavior-in-process 
criterion data. 

Ryans’ research was focused on pupil and teacher classroom behavior. 
Over 3UOO teachers Were observed and rated. Reliability coefficients 
were determined using paired- observers who made independent judgements. 

t 

Coefficients from ,$h to .86 were obtained. The m i n i m um sample of 
teachers observed was 1(3. 

Ryans’ behavior characteristics included: 

Pupil Behavior 
Apathetic - Alert 
Obtrusive - Responsive 
Uncertain - Confident 
Dependent - I' -itiating 



1 David G. Ryans* ‘•C&A^cfcetfistids of Teachers" Washington* D* 
American -Council on Education* 196Q, p.iiOGu. 



Teacher Behavior? 

Partial - Fair 
Autocratic - Democratic 
Aloof - Responsive 
Narrow - Broad 

A total of four pupil and 18 teacher characteristics were used, 
Biddle^* has proposed a seven variable model for determining teacher 
effectiveness, 

1. Formative experiences of the teacher 
2* Teacher properties 
3. Teacher behaviors 
k* Intermediate effects 

5. long term effects 

6. School and community contexts 

7. Classroom situations 

Approaches such as this have been studied in the past by Barr and 
3 

McCall * The model variables have a tendency to overlap in areas and, 
due to the great amount of interaction, this type of model has not been 
generally accepted by the profession. 

Some recent studies in classroom observation have used the approach 



^ J, B, Biddle and William J. Ellena, "Contemporary Research on 

Teacher Effectiveness" Chicago: Holt, Rinehart and Winston, 196k, pp.5~l8, 

2 

A, S. Barr, "Characteristic Differences in Teaching Performance of 
Good find Poor Teachers of the Social Studies" Bloomington, HI,: 

Public School Publishing Co,, 1929* 

^ William, A, McCall, "Measurement of Teacher Merit" Raleigh, N, C. * 

State Department of Education, 1962, 



..uh 



rtirWi i 



8 

X 2 

of studying classroom interaction. Hughes and Flanders have completed 
research in this approach* Flinders developed /an observation instrument with 
ten categories for observable interaction behaviors* Seven were descriptive 
of b<ihavio.:'s of teachers, two wore: of student behaviors, and one was for 
silence* Be distinguished between student and teacher patterns* All cat- 
egories wdve observable in terms of direct or indirect influence of the 
teacher „ He compared the patterns with student achievement and attitudes. 

Garrison^ working with the Stanford Secondary Education Project, 
developed a teacher appraisal guide basesd on the Getz el interaction model 
of social, psychological, and anthropological aspects of a class situation* 
Four steps of teaching were defined* Twenty-five behavioral statements 
were incorporated in the four steps; the final statement being an overall 
appraisal of the teacher. Garrison found significant teacher improvement 
when the observation appraisal instrument was used as an improvement of 
instruction device. The attitude of teachers towards observation was 
improved when used to improve their teaching rather than used as a rating 
device of the teacher. The reliability coefficients of observer agree- 
ment were consistently above *50* He concluded observers could agree on the 
level of performance and that his instrument was sensitive to levels of 
performance. 




Marie Hughes, “Patterns of Effective Teaching, Second Progress Bfeport, 
Merit Study” Provo, Utah: Provo City Schools, 1961. 

Ned Flanders, “Teacher Influence, Pupil Attitudes and Achievement, 
Studies in Interaction Analysis" Minneapolis? diversity of Minnesota, 
U. S. Office of Education Cooperative Research Project No. 397. Mimeo- 
graphed. 

Harry L. Garr ison, “Evaluation of Teaching and Learning” unpublished 
Ed.D. thesis, Stanford University, 1961* • 
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Medley and Mitzel^ describe other classification/3 and methods for 

direct observation and appraisal of classroom interaction# Included in 

these and the types of criteria investigated ares 
o 

Morsh * s s 

1. Instructor verbal behavior 

2# Instructor non-verbal behavior 

3 4 Student behavior 
Anderson ant Brewer 

1# Domination with evidence of conflict 

2. Domination with no evidence of conflict 

3# Integration with evidence of working together 

1*. Nervous habits of students 

Child domination of other children 

6* Non-conformance to teacher demands 

7# Social contributions by the child 
Mitzel and Medley 

1. Pupil supportive 

2, Problem structuring 

3* Directive 

— — — - — — — ^ ^ ■ 

■■ - L 

1 Donald M. Medley and Harold E. Mitzel, "Measuring Classroom Behavior by 
Systematic Observation." Handbook of Educational Research, ed. N. L. Gage 
Chicago: Rand McNally and Co., 1963, p. 2li?-329 

2 j # g, Horsh, Systematic Observation of Instructor Behavior. 05 AF Person- 
nel Training Research Center Development Report No. AFPTRC-TN-^6-^2 

3 h. H. Anderson and Helen M. Brewer, "Studies of Teachers* Classroom 
Personalities II: Effects of Teacher* s Bominative and Integrative Contacts 
cn Children* s Classroom Behavior," Applied Psychology Monograph, 19i*5, No. 6 

h Donald M. Medley and H. E. Mitzel, "Studies of Teacher Behavior: Refine- 
ment of Two techniques for Assessing Teacher *s Classroom Behaviors** New 
tfottc: Board of High Education, City of Netf York, Division of Teacher 
Education, Office of Research and Evaluation, 19f>£« 
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1*. Reproving 

i 

Jersild and others: 

lo Cooperating activities 
2. Experimental activities 
3* Critical activities 
It. Leadership activities 
5* Recitational activities 
6. Self -initiated activities 
7* Work spirit activities 

All of these studies have shown that the classroom observation approach 
to the appraisal of teaching is feasible. They have asked educators what 
should take place in a classroom and they have called upou educators to 
interpret or evaluate what thsy observe in a classroom. 

Another major assumption of this study is that it is both important and 
possible to assess changes in attitudes of beginning teachers. 

Getzeb and Jackson refer to the import &aoe of the teacher's attitude 
as a classroom variable by stating, "The educational impact of an Ichabod 
Crane or a Mark Hopkins, of a Mr. Chips or a Socrates, is surely not due 
solely to what he knows, or even to what he does, but in a very real sense 
to what he is."^ 

Studies of teacher attitudes include one by Callis (1950) who invest- 
igated the changes that occur during teacher training and early teaching 
experience. The measurement instrument used was an extension of the Lesds 



A. T. Jersild et al., "An Evaluation of Aspects of the Activity Program 
in the New York City Public Elementary Schools, n Journal of Experimental 
Education, VXII (April, 1939), 166-207. 

J. W. Geizels and' P. W. Jackson, "The Teacher* s Personality and Charact- 
eristics," Handbook of Research on Teaching, ed«, N. L* Gage Chicago: 
Rand McNally & Company, 1963, p. 506. 









,w, ¥" ^ < 






' , X ~ - 



'V - K . J ’ 



! 



I 












' - * - « 

trii^.'b ■» W * 






^^fXWlrtw^WMa^ati 






11 

inventory. The first ste smiths of professional training produced signifi- 
cant changes in the desired direction 3n 20 per cent of the attitudes (items) 
while the first six ?«c»ths of experience produced significant changes in 
11 per cent of lihe attitudes (items) in m undesirable direction*'*' 

An additional study of the attitude change of beginning teachers after 
initial teaching experience was reported! by Harry P. Day. The study utilised 
in part data collected by administering the Minnesota Teacher Attitude 
Inventory to a group of graduates and to a group of student teachers at 
Florida State University « This was administered to the two groups before* 
asst after an initial teaching experience. The graduates took the test 
initially upori their return fro® intemghip teaching in the public schools. 

A year later it was administered agais. those who had prepared but had not 
entered teachings There was a mean loss of 20*0 for those who had taught for 
one year compared with a mean loss of 1*5 for the non- teaching group* The 
group which was administered the HAXI before and after internship also showed 

a mean loss of h.2. In both groups there was a shift toward less desirable 

2 

attitudes after a teaching experience. 

Here than $ 0 other studies using attitude inventories have been sum- 

3 

marized by Qetzels and Jackson in the '’Handbook of Research on Teaching" • 

For this project the staff chose to use the sonant ic differential technique 
as reported by Osgood and his associates* As a by-product of their work in 
experimental semantics * Osgood; Suci and Tanneribaum developed a new approach 
and rationale for attitude measurement. Attitudes according to Osgood 



^Getzel and Jackson; op* cit*; p* $09 



2 Harry P. Day, "Attitude Changes of Beginning Teachers After Initial 
Teaching ibqperienoe, " Journal of Teacher Education X, September, 195?* 
pp* 326-328* 

^ Qetzels and Jackson, op* cit*, p« 508 
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"can be ascribed to erne basic bipolar continuum with a neutral or aero 
reference point, implying that they have both direction and intensity and 
providing a basis for the quantitative indexing of attitudes".'*’ The semantic 
differential technique for measuring meaning is essentially * combination of 
controlled association and scaling procedures * The subject is provided 
with a concept to be differentiated and a set of bipolar adjective scales 

against which to do it* His task is to indicate, for each item, the direction 

P 

and intensity on a seven-step scale*" 

Evaluation of teachers s behavior appears likely to be affected by a 
variety of variables « The research staff assumes that it is particularly 
important, in measuring teaching behavior, to attempt to control^ or 
compensate for, as many gross variables as possible* In the "Handbook for 
Research on Teaching”, Donald Campbell and Julian Stanley suggest a design 
which appears to the research staff of this proposal to be applicable to 
this problem* 

"In Design b, matching can be recognized as a useful adjunct to 
randomization but not as a substitute for its in terms of scores 
on the pretest or on related variables, the total population 
available for experimental purposes can be organized into carefully 
matched pairs of subjects; members of these pairs can then be 
assigned at random to the experimental or the control conditions* 

Such matching plus subsequent randomization usually produces an 
experimental design with greater precision than would randomiza- 
tion alone."’ 



Charles E. Osgood, George J. Suci, and Percy H* Tannehbaum, The Measurement 
of Meaning, Qrban&, Illinois t University of Illinois Press, 1957, 
pp. 189-191 

^ Ibid* p. 20* 

^ Donald T. Campbell and Julian C, Stanley, "Experimental and Quasi-Ezperi- 
mental Designs for Research on Teaching", ixt Hahdbofck for Research on 
Teaching, N* L. Gage, Editor, Chicago: Rand, McNally and Co*, 1963 
P. 219. 
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of beginning teachers daring their first year of teaching results in 
significant changes in. either the classroom behavior of these teachers 
and/or their attitudes toward the profession of teaching* 

A pilot study was carried out in the spring of 196$ with 23 begin- 
ning teachers of the Seattle School District • The purpose of this stu- 
dy was to test instruments for assessing attitudes and appraising dif- 
ferent classroom behaviors of beginning teachers* The pilot study ms 
also intended to field test procedures for collecting the essential data 
of the study* The staff felt that it was particularly important to as- 
sess procedures for training the observers who would visit beginning 
teachers to evaluate their behavior« This pilot study was to determine 
the specific instruments and the procedures to be used in collecting 
the data for the experiment which was to take place in the fall semes- 
ter of 196$* An additional purpose of the pilot study was to collect 
data about the beginning teachers and relate these data to judgments 
about their classroom performance* The purpose of this effort was to 
determine which characteristics of teachers might be inport ant ones to 
use as matching variables in the later experiment* 

Approximately 120 beginning elementary teachers were to be the popu- 
lation for the experiment* In order to eliminate many variables which 
the staff judged not related to the major questions of the study, the 



tes ters selected for the study were to be town from as homogeneous a 
population as practicable* The population was limited to elementary 
teachers who were women* graduates of teacher-education institutions in 
the State of Washington of the preceding; June* and who were assigned to 
schools of comparable socio-economic pop ulat ions a This socio-economic 
population served by the schools was middle or upper-middle class o 
After determining the teacher population from ^hich the exp er iiriont &X 
subjects were to be drawn* two characteristics of beginning teachers* 
determined by the pilot study* were to be used as matching variables* 
Matched teachers were then randomly assigned to four different treatment 
groups* 

The four groups were: 

!• Group I was to be released of approximately 25 percent of 
classroom teaching time* This released time was to be used 
by the teacher for class preparation* and also for conferen- 
ces with a supervisor who would observe the teachc *s per- 
formance during the weal';* In Group I each teacher was visit- 
ed by a particular supervisor twice a week* and conferred 
with the superivsor on the basis of this observation twice 
a week* The supervisors were trained in the techniques in- 
volved in using a teacher appraisal guide* This appraisal 
guide was the same one used to judge performances of of 
the teachers in the study* 

The research project funds were used by the school districts 
participating in the study to hire experienced substitute 
teachers who took over the classes of Group I teachers for 



15 

a quarter of each day during the period of the experiment. 

The same substitute teacher worked with the Individual Group I 
teacher throughout the entire period of the study* In effect; 
this was a miniature teaching team* 

2* Group II teachers were released from approximately 25 per 
cent of classroom teaching time in the same manner as were 
Group I teachers* The beginning teachers of Group n need 
this released time for preparation for their classes, and 
twice, a week visited an experienced teacher with a simi- 
lar assignment* These experienced teachers, observed by 
Group II subjects^ were selected by the beginning teachers' 
principals as being teachers expert in soma teaching ap- 
proach applicable to the classroom of the Group H teacher* 

3o Group m teachers were given approximately 25 per cent 
reduction in pupil load as were the first two groups* 

This reduction ia load was accomplished, however, by as- 
signing the Group HX teachers to classrooms with approxi- 
mately 25 per cent fever pupils than the average assign- 
ment of teachers in that school district for that grade 
lev&U No other special treatment was given the Group ttt 
teacher* 

2u Group IV teachers were Intended to be a 'control group* 

They were to receive no special treatment comparable to 
that of Groups I H, or XU* They received the orien- 
tation provided other beginning teachers not members of 




t he study, which was the practice in the five coop- 
erating school districts. They were visited by observers 
and an attitude test was administered them as was the 
case with the other three groups* It was honed that the 
visits by observers would be sufficiently different 
treatment for Group IV teachers that this would equalize 

the n Hawthome Effect" which might be experienced by the 

*♦ 

other teachers in the study. 

, Observers were trained to administer an appraisal check list cov- 
ering selected criteria for judging the performance of a classroom 
teacher. These observers were to visit each teacher in teams of three 
on four occasions* The first round of observations was to take place 
during the first four weeks of school in the fall. The judgements 
made on this round of observations were to be used as covarlates in 
analyses of covariance to «ssess changes in chassroom behavior of all 
teachers in the study during the experimental period* The observations 
were repeated at intervals of four and one-half weeks* The last of ths 
four observations was made in January just prior to the conclusion of the 
fall term. The observer teams were rotated so that each observer worked 
with any other observer as infrequently as possible, while still visit- 
ing any one teacher a minimum number of timeso For purposes of scheduling 
the observations, the total group of teachers were divided into three 
groups. The schedule for assigning the nine observers to the three 
groups of teachers for four observation rounds is shown in Table 1 
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TABLE 1 



SCHEDULE AND ROTATION OP TEACHERS ANii 
OBSERVERS IN TEACHER APPRAISAL STUD? 



Teacher 

Group 


Round 1 
Obs* Team 


Round 2 
Obs* Team 


Round 3 
Obs* Team 


Round k 
Obso Team 


1 


ABC 


ADG 


CEF 


BdX 


2 


DEP 


BEH 


LGI 


PAC 


3 


CM 


CPI 


ABH 


GDE 



On the occasion of each observation, each of the experimental 

* 

• 0 

suojects was given a questionnaire to indicate her attitudes towards 
teaching* Thus there were four administrations of the attitude scale 
during the study* 

The data for the study, then, were the judgements of three indepen- 
dent observers of approximately 120 beginning teachers on four differ- 
ent occasions, and the responses of the beginning teachers to the four 



different administrations of the attitude scale* These data were to 



be analyzed by computers to determine whether or not there were signi- 
ficant differences among the groups in terms of either classroom beha- 
vior or attitudes* 



The Pilot Study 

Hiring the spring semester of 1?6£, 28 beginning teachers in the 
Seattle School District were visited three times by a team of three 
trained observers* The observers were trained in the application of 
two scales for appraising classroom teaching* One was tho Ryans 1 
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Classroom Observation Record and the other was the Stanford Teacher 
Appraisal Competence Guides Copies of these instruments appear in 
Appendix A* The two forms were selected because members of the 
research staff had experience in previous studies with these forms, and 

fll flA hfinelioa aamI am 4. ^ xi n f . 
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The Ryans Scale seemed to be concerned with the characteristics of the « 
teacher, while the -Stanford Guide seemed to be focused upon the teaching 
act# Previous experience, with the Stanford form had been limited to 
secondary school teachers, and one of the concerns of the staff was 
whether or not the criteria of this Stanford form were applicable to 
elementary teaching. 

The 28 teachers who were the population for the pilot study were 
all in their first year of teaching* Approximately half of the group 
had begun teaching during the same semester as the pilot study# The 

t 

>c-her members of the group had begun at different times during the pre- 
ceding 'semester# They were chosen because they were the least experi- 
enced teachers in the Seattle School System* This lack of experience on 
the part of the pilot study subjects, was considered the most iuyortsnt 
factor in their selection# Because of the small number of available 
teachers with this limited experience in Seattle, the group possessed 
some characteristics which v are not common to the population for the 
major study in. the fall* For example, the teachers were both secondary 
and elementary teachers, rather than being limited to the elementary 
level* The teachers were also b£ both sexes, and some of them had been 
trained in institutions outside of the State of Washington* None of 
the teachers in this pilot study received any 'cf the experimental treat- 
ments designed for the major study* 



The observers trained for this pilot study were a fort&gr secondary 
teacher of agriculture; a teacher trained for junior high aXssasocms* 
but who was a substitute teacher in the Seattle District in tim inter- 
mediate grades; and a primary grads teacher who was a substitute tea- 
cher in the Seattle systems During the training period* the staff worked 
with other professional person# who they thought might be potential 
observers for the study* including supervisors of student ts&chers and 
consultants to othsr elementary teachers * In this limited experience* 
the staff decided that the substitute teachers recruited for this study 
were somewhat mors trainable than the exp^isaced supervisors* This 
belief was important to the judgements made later in attainting to re- 
cruit observers for the major study* 

Training included seminar sessions with Br* Harry L» Garrison* a 
Research Associate with the project* whose special field of competency 
was training of Seattle school personnel in the appraisal of classroom 
behavior* Dr. Garrison* s previous research included the development 
of the Stanford Appraisal Guide* His major responsibility on this 
project was to train and supervise the teams of observers who visited the 
classrooms of the beginning teachers in the study* The process of 
tra inin g the observers included visits to classrooms of experienced* 
able Seattle teachers and to the classrooms of cadet teachers* Obser- 
vers* in this way* were able to study a range of behavior for the dif- 
ferent criteria used in the rating forms* A need that arose in the 
study was to develop materials for training observers which would pro- 
vide for repeated study of exactly the same teacher behavior* TO meet 
this need* the research staff developed video-taped recordings of tea- 
chers in the Pullman* Washington schools* These tapes were used 
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y^sat&fty throughout the project as training materials for observers* 

Baring this pilot study each of the 28 beginning teachers was vis- 
ited fey this team of thraa aK O SVW fAMfi An 4 * UnAA jS4 

’ * — w '-' »• vwvv* twhw VM VUAOO v^va*" 

sioao® .die visits were four weeks apart and the two instruments under 

8tu<^r wars rotated equally among the observers and among the 28 class- 
rooms* 

A trial form of an attitude test waJ developed and tested with the 
28 pilot study teachers® The attitude test was based upon the ssjiantie 
differential concept of Osgood and others® All of the concepts, which 
in the staffs judgement were plausible objects of beginning teacher 
opinions or attitudes, ware Included in this trial instrument® Also, pairs 
of adjectives were included from the Osgood material which seemed appli- 
cable to these concepts in this context* litis rather lengthy instrument 
was to be given each of the 28 subjects on each of the three occasions 
when the observers visited their classrooms® A copy of the trial f data of 
the attitude scale, appears in Appendix A in underline® 

The 28 teachers for the study root just before the first round of 
observations with the research staff, who explained to them the purposes 
and procedures of this pilot program® On that occasion each of the 28 
teachers was given a questionnaire to fill out, furnishing the staff 
with information about each of the subject's personal and professional 
backgrounds. A copy of this questionnaire is shown in Appendix A* The 
purpose of thie information, as was explained above, was to collect data 
which might be related to the classroom behavior of beginning, tsaohsre® 
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It hoped by the staff that these analyses vould indicate which tm 
factors about the beginning teacher would be important as matching variables 
in the fall study* 

A diary was kept of the experiences of the observers during training 
and during the actual observations* This anecdotal record was meant to 
serve as a guide to the staff in the later conduct of the actual expert- 



The data from the pilot study consisted then of judgements by three 
trained observers, based upon two different rating scales and made on three 
different occasions, as to the classroom behavior of 28 beginning Seattle 
teachers* The data also consisted of these 26 teachers 3 responses to the 
trial form of a Semantic Differential Test of attitudes towards teaching* 

Results of the Pilot Study 

Both of the instruments used to appraise classroom performance of the 
28 teachers in the pilot study were judged administratively efficient and 
capable of high reliability* The coefficients of correlation between 
pairs of observers using the Ryane Scale and the Stanford Form appear in 
Tables 2 and 3* 

The independent judgments of the three observers, whan compared on 
tha different criteria of each instrument, were gratifyingly highly related* 
It appeared to the rSSSs^^%toXt that^tfe first two basic assumptions of 
the study were upheld by this pilot study experience* 

The data concerning observer ratings on the two instruments did not 
indicate a clear preference for one of the two instruments* Tha three 
observers were unanimous in their opinions, however, that the Stanford 
appraisal guide was the simpler to administer and the more logical to 



OBSERVES 00RHSUTX0N5 Q IN THE PHOT 
suns FOR THE STANFORD FORM 



Observers 



Round 





1 


2 


3 


1*2 


.85 


.81 


.89 




1* 3 


.75 


.61 


.95 




2*3 


.77 


•79 


.91 





Pearson product sxsmant correlations e 



TABLE 3 

OBSERVER CORRELATIONS® IN THE PILOT 
STOIS FOR THE RIANS FORK 



Observers 



1& 2 
1 & 3 
23e3 





Round 




1 


2 


3 


.7? 


.72 


.93 


•58 


.75 


.77 


CM 

CO 

e» 


.90 


•yO 



^Pearson product ament correlations* 




use* Their feelings were that viewing the teaching act* was apt to lead to 
a more objective judgement thm studying the characteristics of the teacher, 
which was ths focus of the Byens scale* 

On the basis of the observers 1 judgements and partly due to the fact 
that the Besearch Associate, Dr* Harry L* Garrison, had extensive experience 
with one of the scales, the Stanford appraisal guide was chosen as the basic 

instrument for classroom observations* 

% 

During the summer of 196$, Dr* Garrison, working with the observers 
from the pilot study, modified the Stanford scale somewhat to make it easier 
to administer and likely to be more consistently applied by different obser- 
vers# The resulting modification of the Stanford guide is called the 
Seattle Teaching Performance Appraisal Guide* This became the basic instru- 
ment for assessing the behavior of the experimental subjects in the study* 

A copy 'of the Seattle Teaching Performance Appraisal Guide appears in 
Appendix A* 

The experience of the staff in training the observers for the pilot 
study led to some new practices for preparing the observers who warked with 
the actual experiment. One unexpected finding was that observers were found 
to naed repeated training throughout the study* The opinion of Dr* Garrison 
and the research staff was that the appraisal technique required the obser- 
ver to develop some mental models of different kinds of classroom behavior 
of teachers* As the observer applied these models in appraising specific 
teaching acts, subtle changes in these mental models developed; and, with 
time, the criteria used by a particular observer became less and leas the 
criteria with which he started the observation ax^rlence* Thus it seemed 
to the staff that repeated training, preferably with video-taped record- 
ings, was necessary to maintain a consistent application of the instrument* 





In the judgements of the observers and Or* Garrison, the Stanford 
appraisal guide, as revised in the form of the Seattle appraisal guide, 
clearly was applicable to the elementary school classroom as well as to the 
secondary classroom* Because this form is based upon a model of a couplets 

-Q _ j i. j l j a xt l. i x * i Li _ n 
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vers see the teacher at the point where classes begin in the morning or 
after an intermission during the school day* The observers found, however, 
that all steps in the teaching act, . as developed in the Stanford form, could 
bo viewed in a typical classroom of an elementary teacher in approximately 
one hour* 

The semantic differential trial form designed to test attitudes was 
administered three times to the 28 subjects in the pilot study* The 
responses of the subjects were tabulated for each concept and for each pair 
of adjectives designed to test attitudes regarding that concept* Table 25 
in Appendix B shows the mean ratings of the 28 subjects and standard devi- 
ations of these moans for each pair of adjectives, for each conoept in the 
trl& form* 

It was the staff's judgement that the trial fom of the attitude scale 
was too long* The beginning teachers found it tedious to complete and thus 
tended to mark the responses without sufficient care* In the interests of 
shortening and improving the scale the staff eliminated what seemed to be 
th© least promising concepts and pairs of adjectives* Through inspection, 
the staff eliminated concepts on i&ich responses tends' i group near 4*0, 
and for which standard deviations were small* These s of patterns of 
responses suggested that attitude intensity was low* (h*0 is "neutral 11 
on the seven point scale*) The staff also eliminated those items where 
msafts of responses were near 1*0 and where standard deviations were small* 



These patterns suggested that there was little range of response, and vari- 
ations among groups In the experiment would be small* Also, some adjective 
pairs consistently appeared to lack discrimination power, and these were 
eliminated. The staff also decided to use only concepts for which at least 
six adjective pairs appeared to be promising discriminator?. One of the 
concepts did not meet these subjective criteria and was still kept in the 
final form of the scale* This concept was “teaching as a career* n The 
staff left this one in the final form because it was most closely related 
to the principle concern of the project in this phase of the study* The 
revised form of the Semantic Differential Attitude Scale appears in 



TABLE k 

PRODUCT mwm CORRELATIONS OF PILOT STUD! SUBJECTS 9 GRACE 
POINT WITH OBSERVER RATINGS OF THECR TEACHING 

(N-26) 



Grade Point Averages 



Correlation Coefficients With: 

Ratings on Ratings on 

Stanford Form Ryans Form 



Cumulative Grade Point 

average in college *06 -.68 

GPA in Education Courses *20 *19 



TABLE $ 

MEAN TOTAL SCORES ON TEACHING PERFORMANCE BT PILOT STUDY 
SUBJECTS ACCORDING TO COLLEGE ACADEMIC MAJOR 



Major 


N 


Mean of Total Scores 
on Stanford Form 


Mean of Total Scores 
on Ryans Form 


Social Studies 


12 


107.75 


212.25 


Languages Arts 


6 


109.16 


22lt. 


Other 


8 


302.75 


224.50 
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t able 6 

teaching msommBB op phot stubt m&ms compared to 

GRADES IK ST&S3NT TEACHING 





At or Below Mean 
of Ratings on the 
oyane scale 


Above Mean 
of Ratings on 
Ryans scale 


"A" in Student Teaching 


7 


7 


"B 11 in Student Teaching 


6 


6 


Chi Square e 0 




At or Below Mean 
Ratings on the 
Stanford Scale 


Above the Mean 
Rating on the 

Stanford Scale 


"A n in Student Teaching 


6 


8 


"B" in Student Teaching 7 5 

Chi Square for 2x2 fold table, corrected for continuity® * 1.39 (n.s) 



a Hotiry /So Gahrstt, "Statistics In Psychology and Education 3 , David 
McKay Co., Inc. 1962, p. 26$. 



TABLE 7 

TEACHING PERFORMANCES OF PHOT STUM SUBJECTS COMPARED 
TO GRADE LEVEL ASSIGNMENT 





Below Mean of Ryans 
Seals Ratings 


Above Mean of Ryaa& 
Scale Ratings 


Primary 


7 


7 


Intermediate 


3 


$ 


Chi Square for 2x2 fold table, corrected continuity* * .87 (n.s.) 




Below Mean of Stanford 
Seale Ratings 


Above Mean of Stanford 
Scale Ratings 


Primary 


? 


7 


Intermediate 




h 


Chi Square • 0 







a Garrett, op. cit. 




Table 2% frppm&ix B, shows a sussary of the data teacher behav- 
ior a© related to general characteristics of the beginning teachers in the 
pilot study» Wo characteristics were found to be significantly associated 
'With the level of teaching performance as determined by observer ratings* 
The design of the project called for a matching procedure in which dus- 
ters of four subjects would be matched according to two characteristics* 

As the data in Table 2h do not suggest two variables, the staff used their 
own judgements and those of members of the advisory committee to the pro- 
ject to make this selection* Grade in student teaching and primary or 
interabdiate grade-lev&L assignment were chosen because they were each two- 
factor variables, and would be staple to apply in matching* Also, the 
commtttao and staff judged them to be likely to be related to teaching per- 
formance in a larger sanple than that of the pilot study* 

Tables k-7 show comparisons pilot study subjects 1 characteristics 
with teaching performances* 

One comparison, college grade point with Ryans scale ratings, appeared 
to be negatively significant* Because the other three comparisons with 
grade point approached zero correlation, the staff judged this significant 
correlation to be due to chance* Final ratings of teachers in the fall 
experiment were correlated with grade point, however, to check on this 
judgement* 

The results of the pilot study then were: 

1* The appraisals by trained observers of the classroom perfor- 
mances of teachers were judged to be administratively feasible 

and sufficiently reliable for purposes of this research* 

% 

2* The Stanford Appraisal Guide, as adapted, was jud^d easier 
to administer than the Ryans Scale, and more likely to be 
applied consistently by different observers* 

3* The factors which were selected to be used in a matching pro- 
cedure for selecting experimental subjects were grade level 
assignment and student teaoher grade* 



k« Thorough and repeated training of observers was judged essen- 
tial for maintaining reliability in the us© of the appraisal 
instruments* 

£• The Semantic Differential Attitude Scale was revised and short- 
ened according to the relative discriminating power of adjective 
pairs and concepts which were responded to by 28 beginning teachers* 

Procedures in the Major Experiment 

While the pilot study was being undertaken, the research staff mst with 
school districts in the Seattle metropolitan area to arrange the conditions 
which wsre necessary for the experiment* Five of the largest school dis- 
tricts in the State of Washington agreed to participate in the study* These 
districts were the Seattle School District, Shoreline School District, 

Edmonds School District, Bellevue School District, and Lake Washington School 
District* To limit the factor of varying school district populations, 
schools in the central area of Seattle were not used for the study* Those 
school populations in Seattle which were used, and the four other districts, 
comprise outlying residential areas in metropolitan Seattle* It was not 
feasible to control the socio-economic backgrounds of the pupils in the study 
to a greater degree* The cooperation of these five districts was a major 
factor in the successful completion of this research* Indirect costs borne 
by these districts were a substantial contribution to the financing of the 
project* 

Hiring the summer of 196$ Dr* Harry Garrison worked with nine experi- 
enced teachers who were recruited to serve as observers for the study* Tbs 
summer training psriod consisted of seminar discussions and practice in 
applying the Seattle appraisal guide to video-type recorded performances 
of classroom teachers* 

Hiring the same period of time the staff obtained the names of all 
beginning elementary teachers who had bean hired by the five partied- 



patiag districts© Data on these teachers were also collected which 
included* the college from which the teacher graduated, the student 
teaching grade, the college grade point average, the school district's 
assignment of the new teacher, the college major study, and the mari- 
tal status of this teacher© These names became the potential popula- 
tion from which the subjects of the study were selected© 

Near the end of the sucsaer of 196$, Ur* Garrison worked with su- 
pervisors from the five participating districts who were involved in 
the treatment of the Group 1 experimental subjects© These supervisors 
were to observe each beginning teacher in Group X for a period of ap- 
proximately an hour and one-half tuice during each week, using the 
Seattle appraisal guide, and then here to confer with the beginning 
teacher during that teacher's released time© The conference was to be 
based upon the supervisor's observation of the beginning teacher© The 
supervisors involved in this treatment were given similar training to 
that given the observers© 

The research staff met with principals and other administrative 
staff members in each of the five participating school districts to 
explain the project and to indicate the nature of the support that was 
expected of the principal© Principals of teachers in Group H were asked 

w 

to select experienced, able teachers which would be appropriate for 
the experimental subjects in Group H to visit during their released 
time© 



Xn late August, the staff worked with the administrators of the 
five school districts to make it possible to juggle class enrollments 




80 that Group III teachers would have tha 2$ per cent smaller class 
size which was required by the design of the study* 

Just prior to the beginning of school the research staff selected 
the subjects from the population of new elementary teachers* The sub- 
jects selected met these criteria: They were women; they were June 
graduates of Washington teacher* education institution; this was their 
first teaching experience; and their assignment was limited to an ele- 
mentary school with grades kindergarten through six* From this group* 
clusters of four teachers were matched on the bases of their respective 
grades in student teaching ( W A B or W B W )* a nd their grade leve l assign- 
ments (primary or intermediate)* These clusters of four matched tea- 
chers were then randomly assigned to each of the four treatment groups* 

The research staff then met with the selected beginning teachers 
in each of the five participating districts* At these meetings the 
staff explained the purposes of the study and the nature of each group’s 
treatment* It was decided to be completely candid with the experimen- 
tal subjects rather than to attempt to conceal the treatmsnt of groups 
other than the ones in which the;' were participating* The advice of 
school administrators was that this could not bs kept secret at c?y 
rate* and would probably be less threatening if the teachers knew the 
Information they would find out anyway at the earliest possible time* 

The beginning teachers selected for the study were given tbs alterna- 
tive of re m ai nin g in the study or dropping out* A tm of the selected 
subjects asked to be dropped from the study* Last minute enrollment pro- 
blems in the cooperating school districts made It impossible for some 
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other selected teachers to participate* The fine! number of subjects 
at the beginning of the study in the fall m3 105* 

In each school district, substitute teachers were recruited to re- 
place Group I and Group n teachers during one-quarter of each school 
day* Typically, the substitute teacher relieved the Group I and Group 
II teacher for a period of time, either after lunch until the after- 
noon break in the school day, or from that point until the end of the 
school day* The same substitute teacher remained with each Group I or 
H teacher throughout the period of the experiment* Substitute teachers 
did net take up their duties, however, until after the Group I and II 
teachers had been visited the first tins by teams of trained observers* 

Group HI teachers were assigned to classrooms which the adminis- 
tration had *een able to limit to the smaller enrollment required under 
the design of the study* Group IV teachers did not receive any of the 
treatments planned for this experiment, but did participate in the regu- 
lar orientation procedures held by the fiva districts for all their be- 
ginning teachers* 

A final period of training was given the nlns observers who would 
visit the beginning* subjects* This final training included visits to 
the classrooms of experienced Seattle teachers, and to classrooms whe.*' 
cadet teachers were doing their practice teaching* These observers also 
reviewed video tapes of experienced teachers* The first round of ob- 
servations began one week after the fall tarsi opened in the five dis- 
tricts and continued for a two and one-half week period* Hiring this 
round of observations, as in subsequent rounds, each classroom tsaoher 
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in the four groups was visited by a team of three observers* The obser- 
vers made their judgements independent of each other, and recorded those 
judgements on the Seattle appraisal guide while they Here actually in 
the classroom* 

Prior to this visit eaoh teacher had received a letter informing 
them of the intending visit, and also asking them to fill out an enclos- 
ed copy of the Semantic Differantial Attitude Scale* Upon leaving the 
classroom teacher after the observation, the observers picked up the 
completed Semantic Differential Attitude Scale* 

Between tbs first and second round of observations the nine obser- 
vers vent through a brief period of retraining, and were reassigned to 
different teachers and as members of different teams of observers* The 
plan of the project was to rotate observers so that there would be a 
minimum number of times whan any given observer worked nith the same 
observer or saw the same beginning teacher* (See Table 1} 

After this first round of observations, experimental treatments 
for Groups I and H subjects were begun* Group in teachers had received 
the experimental treatment from the opening of school* This was neces- 
sary because there was no administrative method for reassigning pupils 
after school started to meet the experimental conditions of a smaller 
class sias as required in the &«mdy* Thus a limitation in the study was 
that the Group HI teachers received from two to four weeks more treat- 
ment than did the Group I and II teachers* 



Observers visited the beginning teachers at one of three times? 
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at the beginning of school in the morning, after classes reconvened 
following the morning recess, or at the beginning of classes after 
lunch* The team remained with the classroom teacher until they had 
observed, to each observer^ satisfaction, the entire teaching act which 
was the model for the Seattle appraisal form. If a particular teaching 
behavior could not be observed by an observer, ha was instructed to mark 
that behavior aero. 

Four and one-half weeks after the first round of observations, the 
teams made a second round and also administered the Semantic Differen- 
tial Attitude Test a second time. This procedure was repeated, always 
with additional training of the observers, a third and a fourth tim9. 
Final observations were completed in January, just before the end of 
the fell school term. The schedule of observations was made so that 
the interval between observations was the same for each beginning tea- 
cher in the study. During the fall term, because of illness or other rea- 
sons, the experimental group was reduced to 96. TLir rather large 
loss of subjects was due partly to the fact that subjects ware matched 
in clusters of four. When one subject was lost, data for all four 
matched subjects in the cluster had to be dropped from the final 
analysis. 

The data collected from the experimental phase of the project in- 
cluded observations by three different teams of trained observers of 
beginning teachers who were &iven four different treatments. These obser- 
vation data were ratings of ten different teaching behaviors appraised by 
the t br©e observers on four different administrations. The data also 
Included four different administrations of an attitude scale to these 
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beg inn ing teachers. Thae© data t?sre then analysed by cesprters In fcfee 
Research Oifieo of the State Sug^rintendent of Public Instruction. 
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CHAPTER V 



ANALYSIS AND DISCUSSION OF THE DATA 
Introduction 

The main effect to be studied was the 'experimental treatments of three 
groups of matched beginning teachers compared to a fourth matched group on 
the basis of theta classroom performances* The data collected for this 
analysis were the judgements of observer teams as to ten different teaching 
behaviors of these subjects* These judgements were made four times at 
approximately one-month intervals* 

The first step in the analysis of the main effect of the study was an 
assessment of the reliability of the observers 1 ratings* The independent 
judgements of observers who rated the same teacher were compared as a test 
of reliability. To see if the ten different teaching behaviors rated were 
in fact different, the combined rating^ of observers for each behavior were 
correlated with each of the other behaviors* 

Then the observer ratings of the four treatment groups were analyzed to 
see if there were differences in teaching performances among the groups* 
These analyses of group differences were made for each of the four rounds of 
observer ratings* 

A secondary effect of the study was to be the apparent effects of treat- 
ments upon the attitudes of the beginning teachers* Teachers in the four 
treatment groups responded to an attitude scale based upon the semantic 
differential technique as reported by Osgood and revised on the basis of 
pilot study results* The attitude scale was administered four times at 
monthly intervals* The responses of the four groups were studied to deter- 
mine If there were differences among the four groups in attitudes towards 
different concepts associated with teaching* 



The total group's ratings on the four observation rounds were coupared 
Tilth grade point averages in college and, separately, with grade point in 
Education courses* 

Analysis of the Observation Rating Procedures 
Teams of three observers visited teachers In each of the four treatment 
groups. The observers made their judgements of ten different teaching 
behaviors independently. There were a total of nine observers* Xn the four 
rounds of observations, observer assignments were rotated so that a different 
team judged each teacher on each of the four rounds* Table 8 shows correla- 
tion coefficients of the three observers' ratings in each team for each of 
the four rounds* In order to conpare three observer scores, Pearson pro- 
duct moment coefficients, which were obtained for each pair of observers, 
were converted to Fisher Z scores. 1 Product moment correlations between 
pairs of observers are reported in Appendix Bo 

As reported In Table 8, correlations of observer judgements ranged from 
•29 to *88* The total correlation of all observers on each round varied 
from *38 to *82* The degree to which observers judged teachers the same way 
was less than reported in the pilot study* This lesser amount of agreement 
may have been due to the greater number of observers (nine compared to 
three), or to the changes in the evaluation instrument* The instrument 

1 Henry F* Garrett, "Statistics in Psychology and Education*!, New York* 

David McKay Co*, Inc*, 1962, pages 13b, 172* 
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TAB IS 8 



OBSERVER CORRELATIONS! TSUI BX ROUND SQC BEHAVIOR 



Oboo* ¥ U& 






* 


Behaviors 








Team 


X 


2 


3 


It S 6 7 


8 


2 


10 Overall 



Bound 1 



1 


•1(0 


..45 


.52 


.52 


.35 


.52 


•1(2 


.Sit 


.1(2 


.22 


fc 


2 


.52 


.55 


.66 


.63 


.61 


.65 


,65 


.56 


.32 


.1(6 


.57 


3 


.53 


.67 


.58 


.63 


.lit 


.52 


.61 


.55 


•65 


.57 


.52 












Bound 2 


0 • 










1 


.58 


.75 


.68 


.52 


•7k 


.67 


.66 


•62 


.72 


.72 


.67 


2 


.69 


.74 


.75 


.75 


.62 


,76 


.78 


.71 


.61 


.56 


.72 


3 


>2t) 


.21; 


•2k 


.20 


.32 


.31 


.22 


.22 


.31 


.23 


.27 



Round 3 



1 


>80 


.87 


.78 


,81 


•8k 


•88 


.80 


.81 


.85 


•81 


.82 


2 


>28 


•28 


.36 


.35 


.53 


.52 


.53 


.22 


.22 


.33 


.38 


3 


>62( 


.60 


.5!t 


.58 


.61 


.58 


.67 


.60 


•$k 


.58 


.52 




- 








Bound 1$ 












1 


.29 


.1(2 


.1(5 


.38 


.63 


.61 


.1(2 


.ilO 


.ItO 


.1(7 


•1(7 


2 


.76 


.72 


.77 


.77 


.75 


.77 


.76 


.75 


.80 


.78 


.77 


3 


.81 


.72 


.86 


.81 


^36 


•86 


M 


08 l 


.83 


.80 


•8? 



a Correlations are from inverted Fisher Z seoros of Pearson r's 
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used in the ©apartment was adapted from the Stanford scale used in the pilot 
study* 4 third possible source of variation might be the Interaction of 
observers and the instrument* Inspection of the data on observer agreement 
revealed that two of the nine observers accounted for most of the variations 
in observers* Judgements © When the ratings in which the two observers par- 
ticipated are omitted (Table $) the remaining coefficients of correlation, 
especially on the fourth round, are similar to those obtained in the pilot 
study* 

In spite of the relatively low degree of observer agreement, calcula- 
tions based upon the Spearman-Brown reliability formula 1 produced coeffi- 

V 

dents sufficiently high in the judgement of the project staff to warrant 
further treatment of the observer data* (See Table IQ) 

Inter-item oorraiations were computed for pairs- of the taa behaviors. 
These correlations were calculated for each of the four rounds of observa- 
tions* The coefficients of correlation between teaching performances based 
upon mean ratings by observers in Round h are reported in Table 11, The 
very high correlations obtained suggest that either different behaviors 
are closely related in actual practice or observers tended to be influenced 
by some behaviors when rating others* 

The evaluation instrument was conceived of as ten behaviors constitu- 

. v , 

ting four steps in a teaching cycle* These steps were (a) combinations of 
behaviors 1 and 2, (b) behaviors 3, h and 5, (c) behaviors 6,' 7 and 8, and 
(d) behaviors 9 and 10* Inspection of the correlations between behaviors 
suggests, however, that there is little justification for seeing these com- 
binations as distinct steps in the teaching process, and considerable 

^Garrett, page 3li3* 
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TABLE 9 



CORREEiATIOHS OBTAINED WHEN TWO OBSStVERS* 
RATINGS VERB (RUTTED 




1*0 

TAB IE 10 



o 

ERIC 



Observer 



RELIABILITY OF OBSERVER APPRAISALS BY TEA?! 
BY ROUND BY BEHAVIOR 



jjenavior 



Team 


1 


2 


3 




5 


6 


7 


8 


9 


10 


Overall 












Round 1 












1 


•67 


.71 


.77 


.77 


.62 


.77 


•7b 


.78 


.71* 


.55- 


.72 


2 


.77 


.79 


.85 


•81* 


.82 


.85 


.85 


.79 


•66 


.72 


.80 


3 


•81* 


.86 


.81 


.81* 


.78 


.81 


.82 


.79 


.85 


.80 


.81 


] l,r ' T " r “"' " 










Round 


2 












1 


.79 


.90 


.86 


•61 


.90 


.86 


.85 


.81* 


.88 


.88 


•86 


2 

; 


.87 


.90 


.90 


.90 


•81* 


.87 


.85 


.88 


.82 


.79 


.88 


3 


.1# 


•li9 


.1*9 


•1*3 


.59 


.57 


.55 


.55 


.57 


.1*7 


.53 


\ 

Round 3 


1 1 


.92 


.95 


.92 


.93 


.91* 


.96 


•92 


.93 


.95 


.93 


.93 


1 2 


.51* 


•5lt 


.63 


.62 


.77 


.77 


.77 


.55 


.55 


.60 


.6b 


! 3 

1 


.81* 


.82 


.78 


.81 


.82 


.81 


.86 


.82 


.78 


.81 


.81 


{ Round 1* 

I 


! 

h \ 


.55 


•71* 


.71 


.65 


.81* 


.82 


.7b 


.67 


.67 


♦73 


.73 


< 2 


.67 


.92 


.91 


.91 


.90 


.91 


.91 


.90 


.92 


.92 


.91 


3 


.93 


.92 


.91* 


.93 


.91* 


•91* 


.9b 


.93 


.93 


.92 


.93 
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Justification for interpreting ell tha babavic rs or dceely related aspects 
of a general teaching performance* 

Observers were instructed to score zero for a teaching behavior which 
they felt they were unable to observe* One test of the usefulness of the 
instruir*3nts was believed to be the percentage of zero scores* Only 18 of 
1260 behaviors, 1*2*2 per cent, were scored zero. These zero ratings were 
Judged to be an insignificant limitation to the usefulness of the instrument* 

Finally, each of the nine observers and two members of the research 
staff, who were occasionally used as substitute observers (one of whom was 
an observer in the pilot study), were questionnaired as to their opinions 
about various aspects of the instrument* A summery of their responses to 
the questions which were asked appears in Table 12* These tabulations 
show that the observers were nearly unanimous in their belief that the 
instrument was a fair and valid basis for evaluating teaching, and that 
each teaching performance represented a distinct behavior which they felt 
they had no difficulty in isolating during tleir appraisals of the begin- 
ning teachers* 

Analysis of the observers 1 ratings and a questionnaire to the obser- 
vers resulted in the following Judgements by the project staff concerning 
the observation procedure: 

1* Observer agreement was less than anticipated, but was Judged to 
be adequately reliable for purposes of further analysis of the 
data* 

2* The relatively low agreement among observers probably produced 
considerable variability of the ratings within treatment groups. 
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TABLE 12 

OBSERVER EVALUATION OF TEACHING BEHAVIORS 
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3. High correlations between mean scores of different teaching behav- 
iors suggested that the different scores should be treated as 
different approaches to rating one general complex of teaching 
performances. 

iu The observers supported the evaluation instrument and the proce- 
dures for appraising teachers 9 This was judged to be supporting 
evidence that the procedures were valid. 

Analysis of Classroom Performance 

The observer teams made their first visits to the teachers in the four 
different treatment groups beginning with the second week of school and 
continuing through the fourth week* The mean ratings of these observer 
teats for each of the ten teaching behaviors are reported for each group 
in Table £> The same table also reports the F ratios in an analysis of 
variance. No differences which were significant were found among the 
groups fcr any of the ten behaviors. Except for behavior number 1, the F 
ratios, in fact, approached zero* Apparently the etching procedures and 
selection techniques were effective in equating the four groijps at the 
outset of the study. Inspection of the mean scores shows that the average 
levels of performance of the subjects were somewhat below the average to be 
expected of a theoretical teaching population universe, i.e» luQ on a seven 
point scale. 

The design of the experiment called for a procedure of adjusting such 

differences as would appear in this initial round of observation through 

an analysis of covariance. In the covariance analysis the means on the 

initial round of observations were to be used as the control variable in 

the subsequent analysis ef differences among groups on these later observer 

ratings. This analysis of covariance followed six steps suggested by 
Lindquist. 1 
1 4 

* E. F. Lindquist "Design and Analysis of EKperimenia in Psychology and 
Education", Houghton Mffli \ Co., Boston 1936, pages 332-333. 
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•tabu 13 

imN PERFORMANCE EATINGS OP' THE FOUR TBELWM2 
GROUPS ON FIRST BOUND OF OBSERVATION 





Teaching 

Performances 




Treatment Group Means 




F • * 




Group 1 


Group 2 


Group 3 


Group 1* 


Ratios 


1. 


Suitability of Goals 


2.76 


3.13 


3.00 


2.51* 


• 

-4 

CD 


2« 


Acceptance of Goals 


2.55 


2.63 


2.80 


2.61 


•07 


3# 


Bqalanation of Resources 


2.65 


2.90 


3.01* 


2.78 


•S3 


U# 


Selecting the Plan 


2.5? 


2.7o 


2.81 


2.59 


•13 


5. 


Organising the Class 


2.7U 


2.72 


2.82 


2.96 


•12 


6© 


Classroom Control 


2.72 


2.89 


2.89 


2.85 


•07 


7. 


Classroom Climate 


2.61 


2.70 


2.87 


2.80 


•12 


8. 


Active Participation 


3.01* 


2.89 


3.09 


2.93 


.0$ 


9o 


Measuring Achievement 


2.72 


2.92 


2.81 


2.1*8 


•Ob 


10© 


Going Measurement 


2.70 


2.82 


2.70 


2.1*7 


.23 


11. 


Performances 1 and 2 


2.70 


2.88 


2.90 


2.57 


• 6 


12© 


Performances 3, k and 5 


2.65 


2.79 


2.89 


2.78 


•op 


13. 


Performances 6 9 7 and 8 


2.79 


2.83 


2.95 


2.86 


•OJ 


lb. 


Performances 9 and 10 


2.72 


2.87 


2.76 


2.1*8 


.30 


15. 


Performances 1-10 


2.72 


2»5b 


2.88 


2.70 
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Observers were re-assigned to different team® and went through a 
period of retraining before making a second round of visits to the beginning 
teachers. This second round took place four and a half weeks aft®** the first 
round. The ratings of observers for each group and for each teaching be- 
havior are reported in Table lb* Observers judged the entire population 

consisting of all four groups to be inferior in performances when compared 

* 

to the initial round of observations. On the kO mean ratings, 38 are lower 
than the mean ratings made in the first round. Inspection of these data 
suggests that there are no differences among treatment groups in the patterns 
of change. 

The analysis of covariance is shown in Table 1$. As expected from the 
inspection of data, no significant differences were found among the four 
groups for any of the ten behaviors. 

A third round of observations was made following the same procedures 
which were carried out in the second round. Observer teams* judgements 
are reported for each of the ten behaviors in Table 26. Inspection of 
this table shows a different tendency for the mean scores to vary than was 
found in the second round. Groups I and II show consistent improvement, 
while Groups III and IV remain at relatively low levels of teaching perform- 
ance. All of the mean ratings for Groups I and II show gains over their 
observer ratings on the second round. In the case of Group I, nine of the 
oen behaviors are rated higher than they were rated on the first round. 

In the ease of Group II, s^ven of the ten behaviors were superior to 
round one. This consistent pattern of changes was not considered to be 
explainable in terms of pure chance. 

The analysis of covariance is shown in Table 1?» No significant 
differences were found among the four groups for any of the ten behaviors* 




MEaN SCORES OF FOUR TREATMENT GROUPS ON OBSERVATION RCUNDS 1 - AND 2 
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TABES 1$ 

ANALYSIS OF COVARIANCE: SKSOFICANCB OF DIFFERENCES OF MEAN SCORES AMONG 
THE FOUR GROUPS, EACH OF 18 SUBJECTS, OBTAINED ON THE SECOND 
ROUND OF OBSERVATIONS (ROUND 1 OBSERVATION RATINGS 

USED AS COVARIABLE) 



Soapce 


Sums of Squares Sums of Cross- 

for Round ? .* Products- 


Sums of Squares 
for Round 1 


4° 


1» Teaching Performances 


Suitability of Goals 






Treatments (T) 
Subjects (S) 
lx§ (Erroru E) 


1.35 

28.57 

95.63 


2.17 

17.10 

21.5b 


3.72 

15.90 ' 
8lo£l 


3 

17 

51 




F ratio 


• .063 (n.a.) 






2* Teaching Performance: 


Student Acceptance of Goals 




Treatments (T) 
Subjects (S) 

T x S (Error: E) 


3.79 

38.37 

93.98 


• .01 
1.61 
- 2.10 


.38 

56.b9 

95.b2 


3 

17 

51 




F ratio 


• 0670 (n» 8 ») 






3* Teaching Performance* 


Use df Resources 






Treatments (T) 
Subjects (S) 

T x S (Error: E) 


3.07 

29.85 

103.77 


.1*0 

1.1)3 

.70 


1.51 

b9.19 

109.52 


3 

17 

51 




F ratio 


* mh $ 3 (n.s*) 






km Teaching Performance: 


Selecting the Plan 






Treatments (T) 
Subjects (S) 

I x S (Error: E) 


1.27 

33.86 

97.13 


.57 

16.13 

23.9b 


.78 
b9. 78 
99.6b 


3 

17 

51 




F ratio 


* .191 (n*8*) 
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WSL S 15 — Continue 



Source 


Sums of Squares Sums of Cross- 
for Hound 2 Products 


Sums of Squares 
for Hound 1 


df 


Teaching Performance: 


Organizing the Class 






Treatments (?) 
Subjects (S) 

T x S (Error: E) 


1,02 

hZ.k9 

m.to» 


- .It9 
3*. .26 
19*80 


.• 6 $ 

68.06 

91,07 


3 

17 

51 




P ratio 


» ,196 (n. 8 .) 






6 0 Teaching Performance? 


Class Control'; 






Treatments (T) 
Subjects (S) 

T x S (Error: E) 


1.31 

l»9.2l* 

10 S.lslt 


. 1)6 

3l).35 

22.81) 


.33 

68.60 

81.27 


17 

51 




F ratio 


53 5 I 82 iSaSe) 






7. Teaching Performance: 


Classroom, Climate 




Treatments (T) 
Subjects (S) 

T x S (Error: E) 


1.67 

57.71) 

109.33 


.69 

29.01) 

30.03 


.70 
$ 7.99 
100.33 


3 

17 

51 




F ratio 


* o 219 (iic>So ) 






8 . Teaching Performance: 


Student Participation 






Treatments (?) 
Subjeots (S) 

T x S (Error: E) 


1.31 

39.07 


.31 

19.27 

17.72 


•1)9 
51.10 
92) .08 


3 

1? 

51 




F ratio 


» ,198 (n.a.) 







TABLE 15— Continued 




Source 


Sums of Squares 
for Round 2 


Sums of Cross- 
Products 


Sums of Squares 
for Round 1 


df 


9o Teaching Performance: Measuring Achievement 






Treatments (T) 


1.67 


1.19 


1.87 


3 


Subjects (S) 


25.77 


14.37 


46.12 


17' 


T x S (Error: E) 


94.90 


15.91 


82.66 


51 




P ratio B *228 (n»s.) 







10* Teaching Performance: Using Measurement 



Treatments (T) 


3.22 




.71 


1.16 


3 


Subjects (S) 


29.92 




l6 0 6U 


43.74 


17 


T x S (Error: E) 


86.2? 




8*6l 


86.57 


51 




P ratio ■ 


.602 (n, 


)S# ) 






11* Step One: 


Performances 


1 and 2 








Treatments (T) 


8.52 




6.52 


5.10 


3 


Subjects (S) 


133. U 




77.78 


196.93 


17 


T x S (Errors: E) 


355.02 




80.39 


335.95 


51 




F ratio = 


•286 (n< 


>s$) 







12* Step Tito: Performances 3* h and £ 



Treatments (T) 


13.05 


2.90 


4.51 


3 


Subjects (S) 


300.19 


184.52 


484.23 


17 


T x S (Error: E) 


883.72 


197.49 


848.68 


51 


• 


P ratio « 


.237 (n.s.) 
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v 



' i, * 
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51 



TA BLE l5~ Continued 



Source Sums of Squares Sums of Cross- 

for Round 2 Products 


Sums of Squares 
for Round 1 


df 


13c Step Three 


s Performances 6, 7 and 8 






Treatments (T) 
Subjects (S) 
TxS (Errors E) 


11.73 

1(13.8$ 

91U.66 


3.30 

214.81 

213.9$ 


2.29 

$09.$3 

758.12 


3 

17 

51 




F ratio * 


.196 (n.s.) 






lb. Step Four: 


Performances 9 and 10 






Treatments (T) 
Sublets (S) 
TxS (Error: E) 


9.11 

107.12 

350.03 


3.81 

6lo$k 

l»l 8.1*9 


5.92 

178.13 

331.5° 


3 

17 

51 




F ratio « 


•39b (n.s.) 






15. i\ll Performances (total) 


Treatments (T) 
Subjects (S) 
TxS (Error; Ej 


114,11 

3287.88 

901(2.$2 


76.91 

2095.80 

20l»7.$7 


1(2.92 

5023.95 

8152.93 


3 

17 

51 



F ratio ® *210 (n.s.) 
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TABXE IT 



ANALYSIS OF COVAIQ&MCB* SIGNIFICANCE OF 2EFFEMCES OF M2&N COORDS AKONG 
THE FOUR GROUPS, EACH OF 18 SUBJECTS, OBTAINED OH THE THIiJD 
HOUHn OF OBSERVATIONS (HOUND 1 OBSERVATION RATINGS 

USED AS COVARIABLE) 



1 



Sours® 


Sums of Squares Sums of Cross 
for Round 3 Products 


Sams of Squares 
for Round 1 


df 


1© Teaching Performance} Suitability of Goals 






Treatments (T) 
Subjects (S) 

Tx 8 (Errors E) 


9o37 1.3& 
S?,85 17.SU 
86.69 17.75 


3.78 

1*5.90 

81.51 


3 

1? 

5l 




F ratio * 1©80 (n*s«) 






2« Teaching Performances Student Acceptance of 


Gcals 




Treatments (T) 
Subjects (S) 

T x 8 (Error t E) 


7.53 .lo 

37.58 2.2U 

110,01* - 3,0? 


»3^ 
$6*h9 
9$*h 2 


3 

17 

$1 


- 


P ratio « laU.il (n,s,) 






3? Teaching ,PerfOKaanees Use/of .Reaoarceo 






Treatments (T) 
Subjects (S) 

T x S (Errors E) 


5.19 - .32 

37.00 - 1,10 

88,17 - .59 


l,5l 

1*9.1? 

109.52 


3 

17 

51 




P ratio » ,983 (n.s.) 






* 4 © Teaching Performances Selecting the Plan 


• 


- 


Treatments (T) 
Subjects (S) 

T x S (Error; E) 


U.73 - ,57 

32.18 10.ll* 

97.18 17.27 

F ratio ■> .878 (a. s.) 


.78' 

1*9.78 

99.61* 


3 

17 

51 



ERIC 



pr i^e^qam w igg^ppg^^ " ■ 7 ^ rz \ * ?*s &mmzP *y*j* t i qW W l 

IC^^' ‘ , ,^y; A C; v ';' v^V' •••'?;. 
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TABUS 17— Continued 



&VU7C0 


Sums of Squares Sums of Grose- Sums of Squares 
for Bound 3 Products for Round X 


df 


>* Teaching Ikrf drmance i 


Organizing the Claes 






Treatments (T) 
Subjects (S) 

T x S (Error : E) 

~~n V* 


7.8li 

38,29 

13&& 


- 1.87 
10.99 
Ut.67 


*65 

680 O 6 

%.Q7 


3 

17 

51 




F ratio 


■ 1*233: (»•«•) 






6 * Teaching Performance; 


Class Control 






Treatments (T) 
Subsists (S) 

T x S (Error: E) 


5.77 

39.16 

111.60 


- .68 
12 . 6 U 
18.97 


.33 

68.60 

81.27 


3 

17 

51 




l : ratio 


• o9U9 (n. 8 .) 






7« Teaching Performance? 


Classroom Clinate 






Treatments (T) 
Subjects is) 

T x S (Error: E) 


7.71 

l( 0 ,w> 

1 QM& 


~ 1.71 
15.1(9 
25.60 


.70 

57.99 

100.33 


3 

17 

51 




F ratio 


■ 1.1(59 (n.s.) 






- • • - yw „ . *_ , ' 

8 -, Teaching Performance: 


Student Participation 






^ * * '**;*+*+- k ^ „ *_»v , / 

Teeatmepts (T) 
Subjects (5) 

T x S (%ror : E) 


6.77 

37.57 


- .25 

20.09 
22.53 


51*10 

&.08 


3 

1 ? 

& 


}\'V V -> »s - ■. v.'-Vs WUA . . ■ 


F ratio 


« 1.173 (n.e.) 

W* **» ^ * 1 
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F 
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f 





— a — - - - • .... >. -... a- . ■ - . a a VM-h ■ a **&, - — — ■■ ^ ^ 
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TA2LE 17-«Co»tinuod 



Source 



Sums of Stereo 
for Hound 3 



Sums of Cross- 
Products 



Sums of Squares 
for Hound 1 



df 



9* Teaching Perfcrraanca* Measuring Achievement 




2&t»'A 



r' CTWm g: y 

T ..’ '' t -S'/,- 4 

t _ " . <&s V ' 

' „*V*'r* - Vn , '--A 



Treatments (T) 


6o3? 


2.36 


1.8? 


3 


Subjects (S; 


26.77 


15.61 


Ufa* 12 


17 


T x 3 ( Effort t E) 


87.88 


35.92 


82.66 


51 




P ratio • 


1.076 (n.8.) 






3 8$ Teaching Performances 


Using Measurement 






Treatments (T) 


10.59 


2.77 


1,16 


3 


Subjects (S) 


1*2.31 


22.23 


1*3.71* 


17 


T x S (Errors E) 


92.5U 


lii.67 


86.57 


51 




P ratio ■ 


1.778 (n.s») 






11. Step Ones 


Performances 1 and 2 






Treatments (T) 


32.99 


.72 


5.10 


3 


Subjects (S) 


129.31 


55.29 


196.93 


1? 


T x S (Errors E) 


371.53 


83.16 


335*95 


51 




P ratio • 


1.561* (tus.) 






12. Step Twos 


% 

'H 

Performances 3$ Is and 5 


1 * + 1 ] 3 11 




Treatments (T) 


1*9.13 


- 10.65 


u.Si 


3 


Subjects (S) 


303.12 


100.26 


USle.23 


17 


T x S (Errors E) 


Situ 23 


11*7.53 


81*8.68 


51 




F ratio ■ 


1.222 (n.s.) 







I 



■ -m. 

L ->X 












4 l, 

\ ^ 



I 



lv 



ERIC/, 
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i'ABLE 17- -Continued 



V- f\ ' 



: oa^'Jv ' A : 

'’ k JV..*V w, - 



Q/ ,\ v- • * <v „ - ••- . 

, ;=\" -V'vvcr .;. ■ 



*"h 



,£T 



Source 


Sums of Squares Sums of Cross- 


Sums of Squares 


df 




for Round 3 


Products 


for Round 1 




13# Step Three: Performances 6, 7 and 8 






Treatments (T) 


57,71 


- 9.05 


2,29 


3 


Subjects (S) 


336.06 


139.35 


509.53 


1? 


T x S (Errors E) 


882,22 


190.38 


758.12 


51 




F ratio * 


1.306 (n»3.) 






lii» Step Four: Performances 9 and 10 


Treatments (?) 


33.38 


10*2? 


5.92 


3 


Subjects (S) 


130.86 


75.89 


178.13 


17 


T x S (Errors E) 


339.15 


62*06 


331.50 


51 




F ratio = 


lc 501 (n«s 0 ) 






15. All Performances (total) 


Treatments (T) 


671.79 


- 18.37 


62.92 


3 


Subjects (S) 


3528,96 


1663,23 


5023.95 


17 


T x 3 (Errors E) 


8653.71 


1875.66 


8152.93 


51 




S’ ratio ■ 


1,618 (n,,3.) 







. •;/;-;> -V 



I 



B 
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/ •"•*: ; ( ' 
✓ ' f . , ■* 
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Apparently the total variance among the four groups was not sufficiently 
greater fcfaan the variance within each of the four groups. Contributing 
to this vithin-group variance was the relatively low amount .>of agreement 
among observers which has been discussed. Also the means for the total of 
all behaviors for each group ranged from 2 . 3U to 2.96 on a 7-point scale. 

The ratings were thus depressed within the 7-point scale to a relatively 
small range, limiting the potential amount of variance for this analysis* 

The fourth round of observations took place in. January, the last 
month of the fall term. The means for each of the four groups on the 
ten behaviors compared to the respective means of these groups on the 
first round of observations is reported in Table l8. Ths three experimental 
groups (I, II, and III) on this series of observations were rated as 
superior to Group IV in 28 of 30 comparisons. The degree of superiority on 
the total teaching performance of Group I over Group IV was 31 per cent, 
for Group H over Group IV was 26 per cent, and for Group III over Group IV 
was 58 per cent. These percentages are based upon the net gains of groups 
compared to the Group IV mean rating. In Ili.SSS j?3l tings the lowest possible 
score Weis loO To calculate the tine relationship of the net gains of the 
groups to the mean rating of Group IV on ail ten behaviors the actual mean 
rating of Group IV, 2.55, was converted to 1.55* The graph in Figure 1 
shows the actual relationship of these scores. 

Group III was superior to all groups and considered superior to Groups 
I and II by about the 3ame degree these two groups were considered the 
superior of Group IV. The changes in rated jjerformances on the part of 
Group III between round 3 and round k were the greatest degrees of ohange 
noted by observers in the entire study. 

In terms of net change from the beginning of the study to the end 
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Figure S 

Mean Ratings on All Ten Criteria 
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of tha study, six of the ten behaviors* teaching performances 3 through 8, 
show the most marked differences between experimental groups and Group IV* 
These are the behaviors which observers judged most likely to be seen as 

V?AA^> AM - V 1 I I I « • «• . h a. _ . 

w ~ v s v* uj i.ouavAujr vy line readier uurmg r,ne perioa or observations. 
On these six behaviors the superiority of the experimental groups to 
Group I? ranged from 28 per cent to 78 per cent of the mean Group IV ratings. 
The analysis of covariance is given in Table 19* This analysis shows 

4 

no significant difference among the four groups. As in the analysis on 
the third round* in this final round the total variance among groups was not 
sufficiently larger than the variance within groups to be considered 
significant. 

Summary of the Data on Classroom Performance 

There are no differences in total variance among the four treatment 
groups which were significant. There were consistent patterns of difference 
among the groups in certain directions* These patterns are illustrated 
by the graph in Figure 1 which summarizes these tendencies by showing 
the means of the four groups calculated from the total judgements of 
observers on all ten behaviors for each of the four rounds. These consistent 
patterns cf change 3 how: 

1. Groups 1 and II wore rated consistently superior to Groups III and 
IV at the third round of observations. 

2. All three experimental groups were superior to Group IV on the 
final round. 

3. Group: III was superior to all of the other groups on all behaviors 
in the final round. 

Variations of ratings within, treatment groups, the small number of 
subjects in the final matched group analysis* and the limited range of 
performance scores all contributed to the finding that variability among the 
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ANALYSIS OF COVARIANCE: SICMHCANCB OP HEPPERENGES OP MM SCO® MOHG 
THE POUR GROUPS, EACH OP 18 SUBJECTS, OBTAINED ON THE FOURTH 
ROUND OF OBSERVATIONS (ROUND 1 OBSERVATION RATINGS 

USED AS CCVARIAELE). 



Source 



Stuns of Squares 
for Round h 



Si ns of Cross- 
Products 



Sums of .Squares 
for Round 1 



1* Teaching Performance: Suitability of Goals 



df 



Treatments (T) 


8.10 


2io2ii 


3.72 


3 


Subjects (S) 


31*22 


12,30 


1*5.90 


IT 


T x S (Error: E) 


1X0.60 


Hull* 


81.51 


51 




F ratio 


53 1*01 (xus*) 






2* Teaching 


Performance: 


Student Acceptance of Goals 




Treatments (T) 


5.26 


ol2 


.38 


3 


Subjects (S) 


50.0? 


loll 


56.1*9 


17 


T x S (Error? E) 


125.78 


“ * 2 3 


95.1*2 


51 




F ratio 


« *69k (n*s«) 






3. -Teaching 


Performances 


Use of Resources 






Treatments (T) 


11.97 


.78 


1.51 




Subjects (S) 


51.09 


- 3,86 


1*9.19 


17 


T x S (Error: E) 


130.32 


- .59 


109.52 


51 




F ratio 


* 1.529 (jus.) 






h» Teaching 


Performance: 


Selecting the Plan 






Treatments (T) 


li».77 


2.12 


.78 


3 


Subjects (S) 


30.88 


18,61 


1*9.78 


17 


T x S (Error: E) 


13l*.l*8 


18,28 


99.61* 


51 




F ratio 


- 1.779 (n.s.) 
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TABLE 


I 19— Continued 


'vjr 




Source 


Stuns of Squares Sums of Cross- 


Sums of Squares 


df 




for Round k Products 


for Round 1 




5 « Teaching performance : 


Organising the Class 






Treatments (T) 


9.10 


- 1.20 




3 


Subjects (S) 


* 3203 


28.95 


68*06 


17- 


T x S (Errors E) 


11(6.06 


6.11 


91,0? 


51 




F ratio 


« le065 (n«Se) 






6* Teaching Performances 


Class Control 






Treatments (T) 


11.72 


.80 


.33 


3 


Subjects (S) 


40.79 


32.21 


68.60 


17 


T x S (Erro r: E) 


llil(.89 


19.87 


81.27 


SL 




P ratio 


» 1o 350 (n.s.) 






7* Teaching Performances 


Classroom Climate 






Treatments (T) 


n.ik 


1.08 


.70 


3 


Subjects (S) 


36.06 


2U.92 


57.99 


17 


T x 3 (Errors E) 


135.1a 


23.38 


100.33 


51 




F ratio 


“ 1.369 (n.e.) 






6 0 Teaching Performances 


Student Participation 






Treatments (T) 


17.35 


2.09 


.1 (9 


3 


Subjects (S) 


3l(.75 


. 19.96 


51.10 


17 


T x S (Errors E) 


127.29 


26.55 


9k. 08 


51 




P ratio 


• 2.333 (n.s.) 
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TABLE 19— Continued 



Source 


Sums of Squares Sums of Cross- 
for Hound h Products 


Sums of Squares 
for Hound 1 


df 


?• Teaching Performance: 


Measuring Achievement 






Treatments (T) 


9c00 


3.32 


1.87 


3 


Subjects (S) 


35.51* 


17.51* 


1*6.12 


1? 


T x S (Error: E) 


108.30 


19.1*2 


82.66 


51 




F ratio 


■ 1.198 (n.s.) 






10, Teaching Performance: 


Using Measurement 






Treatments (T) 


I5.it8 


2.05 


1.16 


3 


Subjects (S) 


52.23 


13.88 


1*3.71* 


1? 


T x S (Error: E) 


106.09 


3.2.32 


86.57 


51 




F ratio 


■ 2.373 (n.s.) 






11* Step One: 


Performances 1 and 2 






Treatments (T) 


25.78 


9.89 


5.10 


3 


Subjects (S) 


11*8.1*5 


71.95 


196.93 


17 


T x S (Error: E) 


1*1*5.01 


71.59 


335.98 


51 




F ratio 


*• .836 (n.s.) 







12* Step Two: Performances 3> & and 5 



Treatments (T) 


102.82 


6.60 




l».5l 


3 


Subjects (S) 


292.25 


19l».19 




1*81*.23 


17 


T x S (Error: E) 


1168.08 


102.36 




81:8,68 


$1 




F ratio 


• 1.1*60 (n*s.) 
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TABUS 19— Continued 



Source 



Sums of Squares Sums of Cross- Sums of Squares df 
for Round 1* Products for Round 1 



13# Step Three : Performances 6, ? and 8 



Treatments (T) 


1X7,57 


10.15 


2.29 


3 


Subjects (S) 


315.W 


225.li? 


509.53 


17 


IxS (Error: E) 


1162.96 


198.1(3 


758.12 


$1 


* 


F ratio 


■ 1*685 (n*s*) 






lit* Step Four: 


Performances 9 and 10 






Treatments (T) 


1*6,83 


11,07 


5,92 


3 


Subjects (S) 


166.58 


63.10 


178.13 


17 


t'xS (Error: E) 


101.55 


62.33 


331.50 


£L 




F ratio 


- 1.77U (n.s.) 






15* All Performances (total) 


Treatments (T) 


1050.55 


179.78 


1*2.92 


3 


Subjects (S) 


3100.69 


2051.80 


5023.95 


17 


T x S (too r: E) 


1631.73 


1662,01* 


8152.93 


51 




F ratio 


“ 1,14*0 (n.8.) 

* . 
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groups was not significant. The consistent pattern of differences, however, 
does not appear to the. .staff to be a finding which canbe attributed . I 
to chance, and i 3 thus considered to be a significant result of this analysis, 



Analysis of the Data on Teacher Attitudes 



The revised attitude scale was administered to all the beginning teachers 
in the study during each round of observations. The scale consisted of 
seven concepts related to the profession of teaching which appeared to 
discriminate among subjects during the pilot study. These concepts were 
tests, teachers* meetings, school administrators, homework, teaching work 
load, grading and teaching as a career. Adjective scales were taken from 
Osgood* s thesaurus analysis. These scales were tested during the pilot 
study, and those scales which appeared to be most promising as discrimin- 
ators were utilized. Particular adjective scales varied somewhat from concept 
to concept, depending upon the evidence obtained in the pilot study. The 
revised scale appears in Appendix B. 

The first administration of the attitude scale, during the first round 
of observations, took place between the second and fourth weeks of school. 

Each subject was mailed a copy of the attitude test before the observation 
team*s first visit. At the conclusion of the observation the team picked up 
the completed attitude test from the subject. The same procedure was 
followed in the laiar administration of the test. 

ifean scores and standard deviations for each treatment group of begin- 
ning teachers for each adjective pair on each concept are reported in 
Table 28 in Appendix B. The same table shews these data for all four 
administrations of this semantic differential attitude scale. low scores 



ERIC 



■jr 






r»Vv v ;T*r- 






— Z - 

. V' j- a 






- 

< * , 






V ‘ * 



i n n ^ wr 

* „ v ^ 



mj m w p mp 1 WJy 'qww 















! 








indicate relatively high or favorable al titudes, and high scores indicate 
low or unfavorable attitudes. 

The initial assessment of attitudes indicted a generally favorable 
feeling towards the profession of teaching* On the second administration 
of the attitude test, four and a half weeks after the first administration, 
most of the mean scores of the groups were in the negative direction. 

(Table 20 ) There was not much net change in attitude by the entire popula- 
tion of the study after the second administration* A possible exception 
was the changes in attitudes reflected towards the concept, "Teaching 
Work Load". The beginning teachers showed a strong negative change towards 
this concept on the second round of appraisals, followed by a strong positive 
change on the third round of administrations* 

Table 21 reports the net changes in attitude from the first to the last 
administration of the test by treatment groups . Only members of Group II 
changed their responses to the attitude scale in a positive direction* 

Group II attitude scores were slightly lower than other groups at the 
beg inn i n g of the study, which may account partly for this difference in 
pattern of attitude changes. 

Table 22 shows the analysis of variance of group mean scores on the 
attitude scale when data for all four rounds are grouped. Significant 
differences between rounds, or administrations, were found for two of the 
seven concepts. There were no significant differences among treatment groups. 
The significant F ratios between rounds, together with results of simple 
inspection of the direction of attitude changes, suggest that there was 
a significant tendency for all the beginning teachers to show more negative 
attitudes toward teaching after the first month of experience. 

The F ratios obtained for each round in the analysis of variance among 
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T4BZ£ £0 

DIESCTIOH OF ATTITUIE CHANGES K IREAB MT 0 R0PPS SHOWS OK FOOB 
All tUnSTRATXONS OF THE SEMANTIC JEBKRENTUI. SCAI® 



i 

l 



ERIC 





From Round I to 

4 


From Round 2 to 3^ 


From Round 3 to ii c 


Concept 


No* Pos- 
itive 


No* Neg- 
ative 


No Pos- 
itive 


No Neg- 
ative 


No, Pos- 
itive 


No© Neg- 
ative 


1© Tests 


6 


2k 


13 


17 


■ 

16 




2. Teachers* 
Heatings 


* 


/*Q 


15 


17 


17 

i 

j 


15 

i 


3© School Admin- 
istrators 


9 


19 


7 


a 


1 

lU 


13 


l*.. Homework 


8 


22 


18 


3i* 


18 


13 


5© Teaching Work 
Load 


3 


23 


20 


8 


17 


10 


6© Grading 


11 


111 


18 


9 

y 


ii 


16 


7© Teaching as a; 
Career 


>> 

o 


18 


8 


j 16 
* 


12 


11 

me* 



a No changes 9 comparison/, j ^No changes 3 comparisons c No changes U comparisons 

T4BIE 21 

HEP CKflH&ES IN ATTITUDES FROM THE BEGINNING TO THE END 
OP THE STUDY BY THE FOUR TREATMENT (SOUPS 



Group j 


| Negative 


! Positive 


i No Change 




Group I j 


? — — 

39 


! 

! 11 , 


1 1 
» 




Group H 


1 2U 

i 


! 2? 

I ! 


- 




Group HI 


! U3 


: 7 


1 




Group IV 


39 


j 9 j 


3 
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TAB IE 22 

ANALYSIS OF Vomits AMONG SUBJECTS TESTED EITTHIT FOIU TREATS®:? GROUPS 
ON FOUR ROUNDS OF THE SEMITIC DIFFERENTIAL ATTITUDE TEST 



Source of Variance Sums of Squares 


Mean Squares 


df 


F ratios 


1. Concepts Tests 


Treatments (T) 


29U.80 


9 8.27 


3 


1.10? 


Rounds (R) 


76.80 


25.60 


3 


1.99“ 


Subjects tested with- 










in Treatments (SsT) 


8,208.90 


89.23 


92 




Treatments x Rounds (TR) 


117.20 


13.02 


9 


1.01° 


Rounds x Subjects (R SsT) 


3,558.20 


12.89 


276 


* 


2. Concepts Teacher 5 s Meetings 

* 








Treatments (T) 


520*20 


173.1)0 


3 


.99% 


Rounds (R) 


188.70 


62.90 


3 


3.60?# 


Subjects tested with?* 










in Treatments (SsT) 


l6,70iu#> 


181.58 


92 




Treatments x Rounds (TR) 


112.70 


12.52 


9 


•72° 


Rounds x Subjects (R SsT) 


hf 820.80 


17.U? 


276 





3* Concepts School Administrators 



a P„ 



NS, 



1S S)T 

b *R " 

!S 83sT 
0 Pj « HSjr 

,b K»ST 

4 Significant at .0$ level 



' * ■ ; w. c r 



Treatments (T) 


339.90 


113.30 


3 


1.1)7®. . 


Rounds (R) 


130.90 


1)3.63 


3 


3.93“* 


Subjects tested with- 


7,099.60 








in Treatments (SsT) 


77.17 


92 


■ ■ ■ 


Treatments x Rounds (TR) 


93.90 


10.1i3 


9 


»91) c 


Rounds x Subjects (R SsT) 


3,063.50 


11.10 


276 
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MWE 22-Coati.nued 



*4 



r 

wvu* WO VA v 


Sumo 


Ox SquoLreo MuSu Squeu^Oo ux 


O -S- ■& AM 

x xau&vo 


Concepts 


Homework 







Treatments (T) 


232.90 


77.63 


3 


•l*2 a 


Rounds (R) 


73.20 


2h'k& 


3 


.32 b 


Subjects tested with- 










in Treatments (SsT) 


17,095.80 


185.82 


92 


•l»2 e 


Treatments x Rounds (TR) 


69.20 


7.69 


9 


Rounds x Subjects (R S:T) 


5,099.93 


33.1t8 


276 





$• Concepts Teaching Work Load 



Treatments (T) 


<67 -fin 


ifto.on 

— * tr 


*5 


1.3C£ 


Rounds (R) 


70.60 


23.53 


3 


1.81“ 


Subjects tested with- 
in Treatments (S:T) 


13,366.60 


11*5.29 


92 


.75* 


Treatments x Rounds (TR) 


87.60 


9.73 


9 


Rounds x Subjects (R SsT) 


3,597.20 


13.03 


276 





6* Concepts Grading 



Treatments (T) 


607.00 


202.33 


3 


loop 


Rounds (R) 


63.10 


21.03 


3 


.96S»- 


Subjects tested with- 
in Treatments (SsT) 


16,877.20 


183.1*5 


92 


1.30® 


Treatments x Rounds (TR) 


256.90 


28.51* 


9 


Rounds x Subjects (R SsT) 


6,051*. 20 


21.91* 


276 





7* Concepts Teanhiag as a Career 

im^rnmnin r mttt— — — — r m ' m t— r— ~i 



Treatments (T) 


132.90 


idi.30 


3 


•97® 

2.26® 


Rounds (R) 


3l*.50 


n .50 


3 


Subjects tested with- 
in Treatments (SsT) 


1*,1S1.00 


3*5.1*5 


92 


1.39* 


Treatments x Rounds (TR) 


63.90 


7.10 


9 


Rounds x Subjects (S SsT) 


l.ltOJ.OO 


5.09 


276 





the four treatment groups, is shown in Table 23, This table seems to show 
that differences among the four groups as to attitude were slightly more 
pronounced, not significantly, at the beginning of the study and less so 

A 4r » 4f* J'. 

v*JO WW WA VUO 

Inspection of Table 26, Appendix B, shows that there was littlo net 
» 

change in attitude during tile period of the study. Only 1$ of 201; group 
mean scores on different adjective peirs of the smsn concepts tested 
changed by as much as half of a point on the seven-point scale used in the 
semantic differential attitude test) 

The data obtained fares* the semantic differential attitude scale do not 
appear to show significant differences which are related to the experimental 
treatments of the tub* tets in this study* 



Characteristics of Teachers Related to Performance 
Two characteristics of the subjects in the study were correlated with 
teaching, performance® These characteristics were cumulative grade point 
average earned in the bachelor degree program and the grade point average 
of courses in Education. These two averages were correlated with the 
mean ratings of observer teams on each of the ten teaching behaviors 
evaluated in the study. Table 2k reports these correlations for the final 
round of observations. Both sets of correlations show slightly negative 
relationships, although for practical purposes these correlations approach 
sere. There appears to be no correlation of grade point earned in college 
with performance as rated by trained observer teams. Other data regarding 
characteristics of beginning teachers which might have been related to 
performance in this study resulted in such small sub-groups, /hat no 
practical purpose seemed likely to be served by further analysis. 
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\ 

Summary: The Evidence as to the Major Hypotheses 

Hypothesis 1 was that there would be no differences in selected aspects 
of cla33rocm behavior between groins of elementary teachers who have a 
reduction in load ' COSpai^vi with oihwra mth no rwitction* AH three of . the "» 

experimental groups were given a reduction in lead. Compared to Group IV 
teachers. Group HI teachers had 2 $ per cent fewer pupils and Groups I and 
U teachers met their pupils for 2 $ per cent less time during each day. 

The analysis of covariance supports the hypothesis in that no signi- 
ficant differences among groups were obtained* The observed patterns of 
differences would reject the hypothesis in that 28 out of 30 comparisons of 
Groups I, II and III teachers with Group IV on the final round of appraisals 
of classroom performance showed that each of the experimental groups were 
superior to the control group. 

The research staff believed that the consistent findings showing superior 
classroom performances by teachers with reduced loads, together with the 
relatively large percentage differences between mean scores, can not be 
explained as being likely to result from pore chance. The staff judged 
that the weight of evidence would reject the hypothesis, but that others may 
judge differently fma these same data. 

Hypothesis 2 was that there will be no significant differences in 
attitude between groups of elementary teachers with rocmc@d loads and those 
with no reduced loads. Again the comparison to te3t the hypothesis was 
between Groups I, II and III with Group IV* The analysis of variance 
produced no significant differences, neither were there observed patterns 
of differences. Regardless of load, beginning teachers tend to change 
their attitudes towards teaching in a negative direction during the first 
five months of school. Hypothesis 2 was not rejected by the evidence. 
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TAB IE) 23 



P RATIOS PROM ANALYSES OF VARIANCE AMONG TREATMENT GROUPS ON EACH OP 
POUR ADMINISTRATIONS OP THE SEMANTIC DEFPERENTIAL ATTITUDE TEST 
(N w ?2 P of 2# 72 required for .05 level of significance) 



Concept 




P Ratios 




Round 1 1 


Round 2 


Round 3 


Roun d k 


1» Tests 


.56 


.5fc 


2.23 


1.06 


2* Teachers 1 
Meetings 


1.77 


.91 


.62 


.U7 


3* Administrators 


2.71 


1.38 


1.07 


.51 


iw Homework 


.17 


.73 


CM 

• 


•1(8 


5. Teaching Work 
Load 


2.29 


.69 


.79 


1.20 


6. Grading 


2.12 


.65 


.86 


.99 


?• Teaching as a 
Career 


2.51 


| 1.16 


.92 


.37 
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TABLE 2i* 

CORRELATION COEFFICIENTS OF AU-COLLEGE GRAH3 POINT AVERAGES AND 
EDUCATION COURSE GPA WITH FINAL RATINGS OF TEACHING PERFORMANCES 



Teaching Performance 


t with Collage 
GPA 


f with Education 
Course GPA 


!• Suitability of Goals 


I 

• 

M 

ro 


-.09 


2* Student Acceptance 
of Goals 


-.13 


-.07 


3* Use of Resources 


-.11 


er\ 

© 

. 

1 


li. Selecting the Platf 


-.11 


CO 

o 

* 

t 


5. Organizing the Class 


-.1$ 


-.07 


6» Classroom Control 


-.16 


-.0? 


7. Classroom Climate 


-.15 


-St 

o 

. 

1 


8* Student Participation 


-.12 


CA 

O 

. 

1 


9* Measuring Achievement 


-.16 


-.10 


LD* Using Measurement 


-.13 


-.08 


Total, 1* — 10* 
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Hypothesis 3 was that there will be no significant differences in 
terms of selected aspects of classroom behavior between groups of beginning 
elementary teachers who received different types of inservice instruction 
during released time periods. To test this hypothesis. Groups I and H were 
compared. Neither the analysis of covariance or the inspection of patterns 
of difference?; produced evidence which would reject this hypothesis. 

Hypothesis li was that there will be no differences in attitudes 
between groups of teachers who received different types of inservice instruc- 
tion during released time periods- Analysis of variance shewed no diff- 
erences in the semantic differential attitude scale between Groups I and II* 
Group II differed somewhat from Group I (and also from Groups HI and IV) 
in that teachers of Group II showed more evidence of change in attitude in 
a positive direction during the period of the study* The net changes in 
attitude, positive or negative, did not appear to be large, and the research 
staff judged that the evidence of the study supported, rather than rejected, 
hypothesis 2*. 



2a. 



CHAPTER VI. 

SUMMARY, CONCLUSIONS AND SUGGESTIONS F OR FURTHER RESEARCH 

This study was an attempt to see whether experimental treatments involving 
reduced loads and intensive inservice instruction would affect the performances 
and attitudes of beginning teachers. The treatments were chosen because 
an advisory committee to the research project believed in the efficacy of 
these treatments on the bases of their respective backgrounds of experience. 

The 25 per cent reduction in load and the inservice instruction which was 
given twice a week over a period of one semester as treatments to experimental 
groups of teachers consituted rather gross variables. It was hoped that 
these treatments would be sufficiently powerful to be reflected in changes 
in behavior, but the treatments were deliberately kept to a moderate degree 
of intensity in hopes that the project staff could make judgements as to the 
feasibility of administering these treatments. Questions as to cost of the 
treatments and the degree to which the education of teachers would be 
prolonged were factors considered subjectively in setting the intensity of the 
experimental treatments. 

The project staff hoped to conclude from the data of the study to what 
extent an internship program for career teachers would be justified. The 
practical purpose for the study, then, was to determine the degree to which 
internship programs, based upon the experimental treatments in the project, 
would bring about changes in behavior which were proportionate to the cost 
and effort expended. The staff also hoped to be able to identity questions 
which would be important to study with more refined research designs. 

The measures of performance and attitude used were chosen because they 

seemed likely to reveal changes in behavior and would yield reliable data for 

comparing subjects. The purpose of the instruments used was to identity 

differences among subjects, rather than to help them improve their teaching. 

7$ 
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The observation check list is not claimed to be a valid way of assessing 

t 

the entire teaching act, but was chosen because it seemed a promising tool 
for appraising a sanple of a teacher's classroom skills. The semantic 
differential attitude scale was selected because it was easily administered 
and other studies suggested it could be sensitive to differences among 
subjects on concepts which seem to reflect attitudes. Finally, the evaluation 
instruments were chosen partly because different members of the advisory 
committee had experience in other studies with these instruments. 

The experience in this study with the appraisal techniques used to 
evaluate teachers' performances suggests that experienced teachers can be 
trained to use an observation check list and arrive at a fair degree of 
agreement with other such trained observers*. Judging •from the' ratings, of 
groups of teachers by teams of observers. Idle trained observer tends to see 
different behaviors defined in the ch«ck-lLst as different ways of judging 
one, general teaching performance. Also, two of the observers in the study 
rated teachers somewhat differently than the other seven. A limitation to 
this appraisal procedure is the possibility that some observers will differ 
from the majority depending upon personal characteristics which are not 
known to the staff of this project. 

A possible subject for further study would be t&e client to which 
diffsre^t kinds and amounts of professional experience affect observers' 
appraisals of teaching performance. 

The observers in this project were trained in seminar sessions with 
Or* Harry Garrison, who helped develop the Stanford Teacher Competence 
Appraisal Guide and who adapted the Stanford quide in the form used for this 

project. The observers then practiced using the checklist in observations of 

*> 

experienced, able Seattle teachers and in observations of student teachers 
from colleges in the Seattle area. In this way they were able to see a range 
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of teaching behavior* They also practiced by using the checklist to evaluate 
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teaching behavior which had been video 'taped . In this way they were able 
to test their judgements by comparing then with other observers* 

At the start of the project, the observers rated the beginning teachers 
somewhat below what they conceived to be average for teachers in general* 

After about a month, the observers rated these teachers a second time and 
found their performances te be lower than on the first round* These lower 
ratings were reported for all teachers regardless of treatment* The group 
of beginning teachers who received no special treatment remained at this low 
level of rated performance throughout the period of the study* Two of the 
three groups rec&lving treatments showed marked improvement on the third 
round of appraisals. The experimental group which showed no improvement on the 
third round was rated the highest of all on the fourth and final round* At 
the conclusion of the study all three treatment groups were considered 
superior to the no-treatment group on 28 of 30 congiarisons* The differences 
between mear.3 of each of the three experimental groups and the control croup 
on total teaching performance ware between 26 and !;(S per cent of the control 
grgg&’s. mm score* 

In spite of the large and consistent apparent differences between 
groups, analysis of covariance showed that these differences were not signif- 
icant! y greater than the variance within groups* One reason for this was the 
shrinkage in the si^e of the sample. Subjects were assigned to treatments 
randomly after being matched in sets of four subjects* Vlhen one subject was 
lost because of illness or some other reason, the project lost that subject's 
counterparts in each -.of the ether groups* By the end of the study for 
analysis purposes there were 18 subjecta in each group instead of the 26 in 
each group who actually were appraised in the first round of observations* 
Another reason for the lack of significance in the analysis of covariance 
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was the variation in observer ratings within the different tease. Some 
agreement was obtained but the correlations between observers were lower than 
hoped. Another reason for lack of significance in the analysis of covariance 
was that two of the excerimental srouns tended to achieve mean ratings 

«k <* — 

which were almost the same* Thus variability among all four groups was limited 
by the fact that experimental groups were not different from each other, 
even though there were large apparent differences between these groups and 
the control group. 

Although; all three of ths experimental groups were rated superior to 
the control group by the end of the study, there were apparent differences 
among the three <ssparimentsl groups. Two of the experimental groups received 
intensive inservice help in addition to reduced leads. The third group 
received only reduced loads in the form of fewer pupils. At the third round 
the ©roups with inserviee treatments appeared to be markedly superior to both 
the control group and the other experiment?! group. At the final round, 
however, the group with only reduced loads as treatments was about the same 
amount superior to the two inservice group* as these two treatment groups 
were superior to the control group. Why Group I XT, the small- class group, 
made a sudden improvement in teaching performance is not clear. 

Bias on the part of the observer teams was considered as a possible 
explanation for the high final Group XU scores. Although the observers were 
not told to which group any of the beginning teachers were assigned, they 
could have identified Group HI subjects by noting the relatively small 
numbers of pupils in these teachers* classes. If this knowledge could have 
b esn reflected in biased ratings, it is still difficult to reconcile the fact 
that observer teams consistently rated Group IH teachers low for three 
rounds and then high on the last round. There seems no reason for a tillage 
in bias* if such existed. 
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The pattern of improvement in teaching performance on the part of the 
treatment groups receiving both inservice help and reduced loads suggests 
the major change resulting from this treatment occured in the first four 
months and little change occurred after that* (Figure 1 ) This pattern 
suggests an intensive internship type of treatment for beginning teachers 
during the f all of the first year of experience* The Qrcup III pattern, 
however, is inconclusive as to the amount of time necessary for intensive 
treatment 9 From the evidence in this study, we have no way: of predicting the 
level of teaching behavior on the part of Group HI after January. A longer 
study would be necessary to determine if the apparent high levels of perform- 
ances of Group I II teachers would be sustained* $hy having small classes 
was not effective in raising levels of teaching performance for four months, 

K* suddenly became effective is not clear* There seems no explanation, 
however, . other than the change in behavior by Group XIX teachers was related 
to small classes* Inspection of the patterns of changed teaching behavior 
on the part of the experimental groups suggests that an internship program 
should be extended for at least a semester and probably for the first year • 

The amounts of intensive treatment in the fora of inservice instruction, 
however, might be reduced in the second semester* 

The attitude changes of the teachers in this study did not appear to be 
of very great magnitude* There were significant changes in a negative 
direction after, the first month of school* All tlie treatment groups indicated 
a more negative attitude towards school-related concepts on the second 
I adminis tration of the attitude scale than on the first administration* 

Thi s lower level of attitudes was maintained throughout the re m ai n der of the 
study* Changes in attitudes, however, did not appear to be related to 
treatments* 
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Success of the teacher in courses while at college does not appear to be 
related to teaching performance* This seams to be true both of the over-all 
academic performance of the teacher while at college and those courses 
which were part of the professional education requirement* Correlations 
of grade point and teaching performance as rated by the Cwowrvor tfwoo WSTO 
consistently negative, but not to a significant level* 

In the judgement of the: staff the change in. classropm performance of tbs 
experimental groups and the lack of change on the part of the control group 
justify the development of a plan for internship as training for career 
teachers in the elementary school* In spite of the gross character of the 
experimental variables and the difficulty of controlling all factors which 
might have affected the results of the study, the general findings about 
teachers 1 .behavior were such that the staff fa^ls that it is highly unlikely 
that variables other than the treatments administered to the experimental 
groups could have accounted for the observed differences* 

Questions for Further 8tudy 

1* Will the advantage in teaching skill apparently demons treated by 
teachers in the experimental groups of this study be observeable at the end of 
one academic years at some point during the second year of teaching? 

2 c W311 a combination of small class size (Group HI treatment) and 
intensive inservice instruction of a type comparable to that administered 
to Groups I and H contribute to a level of teaching performance higher 
than that achieved by teachers given either of the three treatments in 
this project? 

3* WULl the general conclusions of this study as to the effects of 
reduced load and inservice instruction apply to beginning secondary teacaors 



as well as elementary teachers* 

2u Would recommended treatments other than those administered to the 
subjects of this study contribute to similar or different levels of teaching 

Borfft rB3K» a? 

5* Will beginning teachers given an internship experience similar to the 
c^psriaental treatments in this study remain in teaching longer thftft the 
average beginning teacher, or otherwise demonstrate a high level of committ- 
ment to the profession of teaching? 

6 e What special competencies and experiences are required for the person 
who provides intensive inservice instruction* to the bfgptaing teacher? 

7* If future teachers while in college are given systematic instruction 
and practice in demonstrating the specific behaviors evaluated in this study, 
will they then demonstrate these behaviors at a significantly higher level 
than comparable beginning teachers? 
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RYANS* CLASSROOM OBSERVATION RECORD 

Class or 
^ e x — .S ubi ect 



School 



Time 




PtfPIL BEHAVIOR 

1* Apathetic 1234567N 
2. Obstructive 1 2 3 4 5 6 7 N 



3* Uncertain 
4, Dependent 



1234567N 



1234567N 



REMARKS 



Alert 

ResDonsiMe 

* 

Confident^ 
Initiating 



TEACHER BEHAVIOR 
5* Partial 



1234567N 



6. Autocratic 1234567N 



7. Aloor 



1234567N 



8. Restricted 1234567N 



9. Harsh 
10. Dull 



1234567K 



123456 7 H 



11. Stereotyped 1234667 W 

12. Apathetic 1234567N 

13. Unteoreasive 1 2 3 4 5 6 7 1 



14 * Evading 

15. Erratic 

16. Excitable 

17. Uncertain 



1234567W 
i 2 3 4 5 6 7 N 
1234S67N 



123456 7 K 



18*. Disorganized 1234567 N 

19. Inflexible 123 4 567N 

20. Pessimistic 1 2 3 4 5 6 7 N 



21. Immature 

22. Harrow 



1234567N 

1234567H 



Fair 



Democratic 

n 

Responsive 



Understanding 

m 

Kindly 



Stimulating^ 

Original 



Alert 



Attractive 

C 

Responsible 

Steady 



CW St MC Cy^P BI 



Poised^ 
Confident 



Systematic^ 

Adaptable 



Gotimistic 



Integrated^ 

Broad 
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GLOSSARY 



(To be used with Ryans* Classroom Observation Record) 

* 

PUPIL BEHAVIORS 

1- Apathet ic-Alert Pupil Behavior 
Apathetic Alert 



1. Listless 

2. Bored-acting 

3* Entered intq activities half- 
* heartedly 
4 . Restless * 

5* Attention wandered 
6, Slow in getting under way 



!• Appeared anxious to recite and 
participate 

2* Watched teacher attentively 
3* Worked concentratedly 

4, Ssemed to respond eagerly 

5, Prompt and ready tc take part in 
activities when they begin 



2. Obstructive-Responsible Pupil Behavior 
Obstructive Responsible 



1* Rude to one another and/or to 
teacher 

2. Interrupting; demanding attrition; 
disturbing 

3. Obstinate; sullen 

4. Refusal to participate 

5. Quarrelsome; irritable 

6. Engaged in name-calling and/or 
tattling 

7* Unprepared 



3* Uncertain-' 
Uncertain 



1* Seemed afraid to try; unsure 
2. Hesitant; restrained 
3* Appeared embarrassed 
4* Frequent display of nervous habits, 
nail-biting, etc* 

5* Appeared shy and timid 
6* Hesitant and/or stammering speech 



1* Courteous, cooperative, friendly 
with each other and with teacher 
2, Completed assignments without 
complaining or unhappiness 
3« Controlled voices 
4, Received help and criticism atten- 
tively 

5* Asked for halo when needed 
6« Orderly without specific directions 
from teacher 

7. Prepared 



Puoil Behavior 



Confident 

le Seemed anxious to try new problems 
or activities 

2* Undisturbed by mistakes 
3* Volunteered to recite 
4« Entered freely into activities 
5* Appeared relaxed 
6* Spoke with assurance 



